Genes associated with mast cell activation

ABSTRACT

The invention relates generally to the changes in gene expression in mast cells and tissues removed from patients with allergic hypersensitivity. The invention specifically relates to the genes MC21, MC22, MC25, MC33, MC36, and MC39, which are differentially expressed in mast cells compared to normal tissues and in resting mast cells versus activated mast cells.

FIELD OF THE INVENTION

[0001] The invention relates generally to the changes in gene expression in mast cells and tissues removed from patients with allergic hypersensitivity. The invention specifically relates to the genes MC21, MC22, MC25, MC33, MC36 and MC39, which are differentially expressed in mast cells compared to normal tissues and in resting mast cells versus activated mast cells.

BACKGROUND OF THE INVENTION

[0002] Allergic Hypersensitivity

[0003] The inflammatory response characteristic of allergic or hypersensitivity reactions can be elicited by extrinsic antigens such as pollen, dust, food, and chemicals in the environment. There are four main classes of hypersensitivity reactions, which are distinguished by the type of immune cells and antibodies involved and the pathologies produced. In the most common IgE-dependent allergic reactions, the inflammatory response involves mast cell degranulation, or emptying of the granules, triggered by allergen interaction with IgE molecules on the mast cell surface. Present in large numbers in epithelial tissue, mast cells have high-affinity IgE receptors on their surface.

[0004] Inhaled allergens initiate respiratory allergies such as allergic rhinitis, hay fever and asthma, while ingested allergens may cause food allergies. Injected allergens, such as antibiotics and insect venoms, may cause life-threatening anaphylactic reactions.

[0005] The cytoplasmic granules of the mature mast cell contain mediators of allergies, such as histamine, heparin, and proteases. Newly formed lipid mediators, such as leukotrienes and prostaglandins are also rapidly synthesized and released from mast cells upon activation. Small quantities of these mediators released on the mucosal surfaces of the respiratory system cause the symptoms associated with allergic rhinitis for example itchy watery eyes, runny nose, and sneezing. Many of the symptoms associated with asthma are directly associated with the effects of mediators released from the mast cell as well as indirectly through other cells that are recruited to the lung by these mediators. Larger amounts of these mediators contract the respiratory smooth muscles, limit breathing, and may cause an anaphylactic reaction which can be fatal. Release of these mediators in the skin causes edema, erythema, and wheal formation, i.e., urticaria.

[0006] Urticaria is a skin condition characterized by the appearance of intensely itching wheals or welts with elevated centers and a surrounding area of erythema (redness). Wheals are usually distributed over the trunk and extremities of the body, but they may occur on any epithelial or mucosal surface. Urticaria may be acute, lasting six weeks or less, or chronic. A related skin condition, angiodema, with similar, but non-pruritic, sores, affects deeper levels of skin tissues. It is estimated that 10-20% of the population, usually children, suffers from urticaria or angiodema or both simultaneously at one time or another (Frank, M. M., Cecil Textbook of Medicine, 20^(th) Ed., Bennet and Plum eds., chap. 19, pp. 1408-1412, W. B. Saunders Co., Philadelphia, 1996). The most common and most effective treatment to date is administration of antihistamines or glucocorticoids, and in some cases epinephrine. Where antihistamines prove ineffective, the symptoms persist for years, or even decades. In about 70% of urticaria/angiodema cases, the cause of the disease is not found. Typical sources, however, are ingested allergens from foods or drugs, such as non-steroidal anti-inflammatories, or autoimmune-type allergens, such as antithyroid, anti-IgG or anti-IgE autoantibodies. Other causes include physical factors, e.g., heat, cold, pressure and sun.

[0007] Treatment of Allergies and Asthma

[0008] Conventional therapeutic compounds, such as antihistamines (H1 antagonists), prevent histamine from binding to H1 receptors. Antihistamines may be taken to block the effect of histamine released from mast cell granules, but they have no effect on the activities of the other co-released vasoactive compounds. Cromolyn sodium and nedocromil are effective in some patients and may block mast cell degranulation and, therefore, the release of histamine as well as other mediators from mast cells. Lipoxygenase inhibitors or leukotriene antagonists may specifically block the effects of leukotrienes released from mast cells. Other agents such as glucocorticoids, theophylline, and beta-agonists play important roles in the control of asthma. They have all been shown to have an inhibitory effect on either mast cell development or activation. A new approach of targeting IgE directly with monoclonal-antibody therapy has been found to be effective in some patients with rhinitis as well as asthma and has demonstrated the importance of this trigger on the mast cell in these diseases.

[0009] Cellular and Molecular Events Involved in Mast Cell Function and Activation

[0010] It has been demonstrated that within activated mast cells, the expression levels of a number of genes are changed compared to unactivated cells. Generally, however, little is known about the cellular and molecular events associated with mast cell degranulation and regranulation of mast cells, or about the events and stages of cellular maturation that lead to a functionally mature mast cell.

[0011] While the changes in the expression levels of a number of individual genes have been identified, the investigation of the global changes in gene expression has not been reported in human mast cells. Accordingly, there exists a need for the investigation of the changes in gene expression levels as well as the need for the identification of new molecular markers associated with mast cell maturation, activation, degranulation and regranulation. Furthermore, if intervention is expected to be successful preventing or reducing allergic hypersensitivity, means of accurately assessing mast cell maturation and the early stages of activation need to be established. One way to accurately assess the early cellular events involved in mast cell activation is to identify markers which are uniquely associated with the process. Likewise, the development of therapeutics to prevent or stop IgE-mediated allergic reactions relies on the identification of genes responsible for mast cell maturation, activation and de- and regranulation.

SUMMARY OF THE INVENTION

[0012] The present invention is based on the discovery of new gene families that are differentially expressed in mast cells that have been activated through the IgE receptor. These exhibit a high level of expression in resting mast cells as compared to non-mast cell populations or whole tissues.

[0013] The invention includes isolated nucleic acid molecules selected from the group consisting of an isolated nucleic acid molecule that encodes the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; an isolated nucleic acid molecule that encodes a fragment of at least 6 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; an isolated nucleic acid molecule which hybridizes to the complement of a nucleic acid molecule comprising SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15; and an isolated nucleic acid molecule which hybridizes to the complement of a nucleic acid molecule that encodes the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. Nucleic acid molecules of the invention may encode a protein having at least about 35%, 40%, 50%, 60%, or 65% amino acid sequence identity to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, preferably at least about 70% or 75% sequence identity, more preferably at least about 80-85% sequence identity, even more preferably at least about 90% and most preferably 95% sequence identity to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16.

[0014] The present invention further includes the nucleic acid molecules operably linked to one or more expression control elements, including vectors comprising the isolated nucleic acid molecules. The invention further includes host cells transformed to contain the nucleic acid molecules of the invention and methods for producing a protein comprising culturing a host cell transformed with a nucleic acid molecule of the invention under conditions in which the protein is expressed.

[0015] The invention further provides an isolated polypeptide selected from the group consisting of an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, an isolated polypeptide comprising a functional or antigenic fragment of at least 6 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, an isolated polypeptide comprising conservative amino acid substitutions of SEQ ID NO: 24, 6, 8, 10, 12, 14, or 16, and an isolated polypeptide comprising naturally occurring amino acid sequence variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. Polypeptides of the invention also include polypeptides with an amino acid sequence having at least about 35%, 40%, 50%, 60%, 65%, 70% or 75% amino acid sequence identity with the sequence set forth in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; more preferably at least about 80%, even more preferably at least about 90%, and most preferably at least about 95% sequence identity with the full length sequence set forth in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16.

[0016] The invention further provides an isolated antibody or antigen binding fragment that specifically binds to a polypeptide of the invention, including monoclonal and polyclonal antibodies.

[0017] The invention further provides methods of identifying an agent which modulates the expression of a nucleic acid encoding a protein of the invention, comprising: exposing cells which express the nucleic acid to the agent; and determining whether the agent modulates expression of said nucleic acid, thereby identifying an agent which modulates the expression of a nucleic acid encoding the protein.

[0018] The invention further provides methods of identifying an agent which modulates at least one activity of a protein of the invention, comprising: exposing cells which express the protein to the agent; and determining whether the agent modulates at least one activity of said protein, thereby identifying an agent which modulates at least one activity of the protein.

[0019] The invention further provides methods of identifying binding partners for a protein of the invention, comprising: exposing said protein to a potential binding partner; and determining if the potential binding partner binds to said protein, thereby identifying binding partners for the protein.

[0020] The present invention further provides methods of modulating the expression of a nucleic acid encoding a protein of the invention, comprising administering an effective amount of an agent which modulates the expression of a nucleic acid encoding the protein. The invention also provides methods of modulating at least one activity of a protein of the invention, comprising administering an effective amount of an agent which modulates at least one activity of the protein.

[0021] The present invention further includes non-human transgenic animals modified to contain the nucleic acid molecules of the invention or mutated nucleic acid molecules such that expression of the encoded polypeptides of the invention is prevented.

[0022] The invention further provides methods of diagnosing states of IgE-mediated hypersensitivity, urticaria or mastocytosis comprising determining the level of expression of a nucleic acid molecule of the invention or polypeptide of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

[0023]FIG. 1 shows the relative expression of AF150143 (MC21) using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

[0024]FIG. 2 shows a Northern blot, in which the expression level of a 1.79 kb mRNA species corresponding to SEQ ID NO: 1 (clone BD9 (MC21)) was measured in human cultured mast cells or tissues. Lane 1) activated mast cells, Lane 2) resting mast cells, Lane 3) kidney, Lane 4) fetal liver, Lane 5) heart, Lane 6) placenta.

[0025]FIG. 3 shows a Kile-Doolittle Hydropathy plot of the amino acid sequence of the encoded MC21 protein.

[0026]FIG. 4 shows a Parker antigenicity plot of the amino acid sequence of the encoded MC21 protein.

[0027]FIG. 5 shows the relative expression of MC22 using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

[0028]FIG. 6 shows the relative expression of MC25 using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

[0029]FIG. 7 shows the relative expression of MC33 using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

[0030]FIG. 8 shows the relative expression of MC36 using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

[0031]FIG. 9 shows the relative expression of MC39 using the Affymetrix human GeneChip 95K chip set in samples from cultured human mast cells, isolated human hematopoietic cells, and various normal human tissues.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

[0032] I. General Description

[0033] The present invention is based, in part, on the identification of a new gene family that is highly expressed in mast cells, with limited expression in other cell types or tissues. This gene family corresponds to the human cDNA of SEQ ID NO: 1. Genes that encode the human protein of SEQ ID NO: 2 may also be found in other animal species, particularly mammalian species.

[0034] MC21 is initially encoded with a signal peptide which spans amino acid residues 1-38 of SEQ ID NO: 2. The mature MC21 protein is a transmembrane protein which spans amino acid residues 39 to 323 of SEQ ID NO: 2. The extracellular domain of mature MC21 comprises a V-set Ig domain which may bind sialic acid, at about amino acid residues 42 to 161 and a C2 immunoglobulin domain at about amino acids 162 to 254 of SEQ ID NO: 2. The transmembrane domain of MC21 is found at about amino acid residues 279 to 301 of SEQ ID NO: 2 and MC21 has a short cytoplasmic tail from about amino acid 302 to 323 of SEQ ID NO: 2. It is notable that MC21 has a lysine residue within the transmembrane domain at amino acid position 290 of SEQ ID NO: 2, a feature common to proteins which are associated within the transmembrane domain with another protein domain. Three cDNAs have been identified which contain part or all of the MC21 open reading frame, BA4, BD9 and OB4. OB4 is extended 5′ versus the other 2 cDNA clones and contains the entire MC21 open reading frame.

[0035] The present invention further includes additional new genes that are highly expressed in mast cells, with limited expression in other cell types or tissues. These genes correspond to the human cDNA sequences termed MC22.1 (SEQ ID NO: 3), which encodes the polypeptide of SEQ ID NO: 4; MC22.2 (SEQ ID NO: 5), which encodes the polypeptide of SEQ ID NO: 6; MC25 (SEQ ID NO: 7), which encodes the polypeptide of SEQ ID NO: 8; MC33 (SEQ ID NO: 9), which encodes the polypeptide of SEQ ID NO: 10; MC36 (SEQ ID NO: 11), which encodes the polypeptide of SEQ ID NO: 12; MC39 (first open reading frame (ORF); SEQ ID NO: 13), which encodes the polypeptide of SEQ ID NO: 14; and MC39 (second open reading frame (ORF); SEQ ID NO: 15), which encodes the polypeptide of SEQ ID NO: 16. Genes that encode the human proteins supra may also be found in other animal species, particularly mammalian species.

[0036] MC22, which has two splice variants termed here MC22.1 (SEQ ID NO: 3) and MC22.2 (SEQ ID NO: 5), is a human ortholog of the rat protein tomosyn with 89% identity. Tomosyn is a syntaxin- 1-binding protein that forms a complex in the neurotransmitter release process (Fujita, Y. et al. (1998) Neuron 20(5):905-915).

[0037] MC21, MC22, MC25, MC33, MC36 and MC39 may be used individually or in combination of at least two members of the group as markers to detect, diagnose or identify an allergic response in a patient. MC21, MC22, MC25, MC33, MC36 and MC39 may also be used individually or in combination of at least two members of the group as markers to detect, determine or identify the state of mast cell activation, i.e., to determine whether a mast cell is in an activated or resting state. The proteins can also serve as targets for agents that modulate gene or protein expression or activity. For example, agents may be identified that modulate biological processes associated with mast cell function, including mast cell degranulation that leads to urticaria.

[0038] II. Specific Embodiments

[0039] A. The Proteins Associated with Mast Cell Regranulation and/or Allergic Hypersensitivity

[0040] The present invention includes isolated proteins, allelic variants of the proteins, and conservative amino acid substitutions of the proteins. As used herein, the “protein” or “polypeptide” refers, in part, to MC21, a protein that has the human amino acid sequence depicted in SEQ ID NO: 2 or fragments thereof; to MC22, which includes the splice variants MC22.1 (a protein that has the human amino acid sequence depicted in SEQ ID NO: 4 or fragments thereof) and MC22.2 (a protein that has the human amino acid sequence depicted in SEQ ID NO: 6 or fragments thereof); to MC25, a protein that has the human amino acid sequence depicted in SEQ ID NO: 8 or fragments thereof; to MC33, a protein that has the human amino acid sequence depicted in SEQ ID NO: 10 or fragments thereof; to MC36, a protein that has the human amino acid sequence depicted in SEQ ID NO: 12 or fragments thereof; or to MC39, including the first open reading frame (a protein that has the human amino acid sequence depicted in SEQ ID NO: 14 or fragments thereof) and the second open reading frame (a protein that has the human amino acid sequence depicted in SEQ ID NO: 16 or fragments thereof). The terms also refer to naturally occurring allelic variants and proteins that have a slightly different amino acid sequence than that specifically recited above. Allelic variants, though possessing a slightly different amino acid sequence than those recited above, will still have the same or similar biological functions associated with these proteins.

[0041] As used herein, the family of proteins related to the human amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16 refers in part, to proteins that have been isolated from organisms in addition to humans. The methods used to identify and isolate other members of the family of proteins related to these proteins are described below.

[0042] The proteins of the present invention are preferably in isolated form. As used herein, a protein is said to be isolated when physical, mechanical or chemical methods are employed to remove the protein from cellular constituents that are normally associated with the protein. A skilled artisan can readily employ standard purification methods to obtain an isolated protein.

[0043] The proteins of the present invention further include insertion, deletion, conservative amino acid substitution or splice variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. As used herein, a conservative variant refers to alterations in the amino acid sequence that do not adversely affect the biological functions of the protein. A substitution, insertion or deletion is said to adversely affect the protein when the altered sequence prevents or disrupts a biological function associated with the protein. For example, the overall charge, structure or hydrophobic/hydrophilic properties of the protein can be altered without adversely affecting a biological activity. Accordingly, the amino acid sequence can be altered, for example to render the peptide more hydrophobic or hydrophilic, without adversely affecting the biological activities of the protein.

[0044] Ordinarily, the allelic variants, the conservative substitution variants, and the members of the protein family, will have an amino acid sequence having at least about 35%, 40%, 50%, 60%, 65%, 70% or 75% amino acid sequence identity with the full length sequence set forth in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, more preferably at least about 80%, even more preferably at least about 90%, and most preferably at least about 95%, 97% or 99% sequence identity. Identity or homology with respect to such sequences is defined herein as the percentage of amino acid residues in the candidate sequence that are identical with the known peptides, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent homology, and not considering any conservative substitutions as part of the sequence identity (see section B for the relevant parameters). Fusion proteins, or N-terminal, C-terminal or internal extensions, deletions, or insertions into the peptide sequence shall not be construed as affecting homology.

[0045] Thus, the proteins of the present invention include molecules having the amino acid sequence disclosed in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; fragments thereof preferably having a consecutive sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16 of at least about 6 or 10, 15 or 20, or 25 or 30 amino acid residues, more preferably 35 or 40 amino acid residues, even more preferably 45 or 50 amino acid residues, yet more preferably 55 or 60, still more preferably 65 or 70 amino acid residues and most preferably at least 75 or more amino acid residues; amino acid sequence variants wherein one or more amino acid residues has been inserted N- or C-terminal to, or within, the disclosed coding sequence; and amino acid sequence variants of the disclosed sequence, or their fragments as defined above, that have been substituted by at least one residue. Such fragments, also referred to as peptides or polypeptides, may contain antigenic regions, functional regions of the protein identified as regions of the amino acid sequence which correspond to known protein domains, as well as regions of pronounced hydrophilicity. The regions are all easily identifiable by using commonly available protein sequence analysis software such as MacVector (Oxford Molecular).

[0046] Proteins of the invention further include the mature MC21 protein without the signal sequence. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 39-323 of SEQ ID NO: 2. Fragments of the MC21 protein encompassed by the present invention also include the extracellular domain attached to the transmembrane domain, but without the signal peptide or the cytoplasmic tail. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 39 to 301 of SEQ ID NO: 2. The invention also includes a soluble form of MC21 consisting essentially of the extracellular domain of the protein. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 39 to 278 of SEQ ID NO: 2. The invention further includes variant, altered or mutant forms of the mature MC21 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 2. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC21, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC21. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC21.

[0047] Proteins of the invention further include the MC22.1 polypeptide. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-1151 of SEQ ID NO: 4. The invention further includes variant, altered or mutant forms of the MC22.1 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 4. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC22.1, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC22.1. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC22.1.

[0048] Proteins of the invention further include the MC22.2 polypeptide. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1- 1115 of SEQ ID NO: 6. The invention further includes variant, altered or mutant forms of the MC22.2 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 6. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC22.2, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC22.2. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC22.2.

[0049] Proteins of the invention further include the MC25 protein. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-1259 of SEQ ID NO: 8. The invention further includes variant, altered or mutant forms of the MC25 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 8. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC25, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC25. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC25.

[0050] Proteins of the invention further include the MC33 protein. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-350 of SEQ ID NO: 10. The invention further includes variant, altered or mutant forms of the MC33 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 10. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC33, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC33. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC33.

[0051] Proteins of the invention further include the MC36 protein. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-398 of SEQ ID NO: 12. The invention further includes variant, altered or mutant forms of the MC36 protein and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 12. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of MC36, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of MC36. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of MC36.

[0052] Proteins of the invention further include the polypeptide encoded by the MC39 first ORF polypeptide. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-107 of SEQ ID NO: 14. The invention further includes variant, altered or mutant forms of the MC39 first ORF polypeptide and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 14. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of the MC39 first ORF polypeptide, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of the MC39 first ORF polypeptide. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of the MC39 first ORF polypeptide.

[0053] Proteins of the invention further include the polypeptide encoded by the MC39 second ORF polypeptide. For example, the invention includes a polypeptide consisting of or comprising about amino acid residues 1-83 of SEQ ID NO: 16. The invention further includes variant, altered or mutant forms of the MC39 second ORF polypeptide and of the fragments thereof which have amino acid deletions, additions or substitutions to the amino acid sequence of SEQ ID NO: 16. Said amino acid deletions, additions or substitutions may be silent, not changing the biological and/or immunological function or properties of the MC39 second ORF polypeptide, such as a conservative substitution. Said amino acid deletions, additions or substitutions may also destroy or ablate the biological and/or immunological function or properties of the MC39 second ORF polypeptide. Further, said amino acid deletions, additions or substitutions may enhance or diminish the biological and/or immunological function or properties of the MC39 second ORF polypeptide.

[0054] Contemplated variants further include those containing predetermined mutations by, e.g., homologous recombination, site-directed or PCR mutagenesis, and the corresponding proteins of other animal species, including but not limited to rabbit, mouse, rat, porcine, bovine, ovine, equine and non-human primate species, and the alleles or other naturally occurring variants of the family of proteins; and derivatives wherein the protein has been covalently modified by substitution, chemical, enzymatic, or other appropriate means with a moiety other than a naturally occurring amino acid (for example a detectable moiety such as an enzyme or radioisotope).

[0055] As described below, members of the family of proteins can be used: (1) as a diagnostic marker; (2) to identify agents which modulate at least one activity of the protein; (3) to identify binding partners for the protein, (4) as an antigen to raise polyclonal or monoclonal antibodies, and (5) as a therapeutic agent or target.

[0056] B. Nucleic Acid Molecules

[0057] The present invention further includes nucleic acid molecules that encode the protein having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16 and the related proteins herein described, preferably in isolated form. As used herein, “nucleic acid” is defined as RNA or DNA or related molecules that encodes a protein or peptide as defined above, is complementary to a nucleic acid sequence encoding such peptides, hybridizes to such a nucleic acid and remains stably bound to it under appropriate stringency conditions, or encodes a polypeptide sharing at least about 35%, 40%, 50%, 60%, 65%, 70% or 75% sequence identity, preferably at least about 80%, more preferably at least about 85%, and even more preferably at least about 90%, 95%, 97% or 99% or more identity with the full-length peptide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. The “nucleic acid molecules” of the invention further include nucleic acid molecules that share at least about 70% or 75% sequence identity, preferably at least about 80%, more preferably at least about 85%, and even more preferably at least about 90% and most preferably 95%, 97%, 99% or more identity with the nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15 or the full length of the open reading frame defined therein. Specifically contemplated are genomic DNA, cDNA, mRNA and antisense molecules, as well as nucleic acids based on alternative backbones or including alternative bases whether derived from natural sources or synthesized. Such nucleic acids, however, are defined further as being novel and unobvious over any prior art nucleic acid including that which encodes, hybridizes under appropriate stringency conditions, or is complementary to nucleic acid encoding a protein according to the present invention.

[0058] Homology or identity at the nucleotide or amino acid sequence level is determined by BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed by the programs blastp, blastn, blastx, tblastn and tblastx (Altschul, S. F. et al., Nucleic Acids Res 25: 3389-3402 (1997) and Karlin et al., Proc Natl Acad Sci USA 87:2264-2268 (1990), both fully incorporated by reference) which are tailored for sequence similarity searching. The approach used by the BLAST program is to first consider similar segments, with and without gaps, between a query sequence and a database sequence, then to evaluate the statistical significance of all matches that are identified and finally to summarize only those matches which satisfy a pre-selected threshold of significance. For a discussion of basic issues in similarity searching of sequence databases, see Altschul et al., (1994) (Nature Genetics 6, 119-129) which is fully incorporated by reference. The search parameters for histogram, descriptions, alignments, expect (i.e., the statistical significance threshold for reporting matches against database sequences), cutoff, matrix and filter (low complexity) are at the default settings. The default scoring matrix used by blastp, blastx, tblastn, and tblastx is the BLOSUM62 matrix (Henikoffet al., (1992) Proc. Natl. Acad. Sci. USA 89, 10915-10919, fully incorporated by reference), recommended for query sequences over 85 in length (nucleotide bases or amino acids).

[0059] For blastn, the scoring matrix is set by the ratios of M (i.e., the reward score for a pair of matching residues) to N (i.e., the penalty score for mismatching residues), wherein the default values for M and N are +5 and −4, respectively. Four blastn parameters were adjusted as follows: Q=10 (gap creation penalty); R=10 (gap extension penalty); wink=1 (generates word hits at every wink^(th) position along the query); and gapw=16 (sets the window width within which gapped alignments are generated). The equivalent Blastp parameter settings were Q=9; R=2; wink=1; and gapw=32. A Gap comparison between sequences, available in the Accelrys' Wisconsin Package version 10.2, uses DNA parameters GAP=50 (gap creation penalty) and LEN=3 (gap extension penalty) and the equivalent settings in protein comparisons are GAP=8 and LEN=2.

[0060] “Stringent conditions” include those that (1) employ low ionic strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate/0.1% SDS at 50 C, or (2) employ during hybridization a denaturing agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42 C. Another example is hybridization in 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, and 10% dextran sulfate at 42 C, with washes at 42 C in 0.2×SSC and 0.1% SDS. A skilled artisan can readily determine and vary the stringency conditions appropriately to obtain a clear and detectable hybridization signal. Preferred molecules are those that hybridize under the above conditions to the complement of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15 and which encode a functional protein. Even more preferred hybridizing molecules are those that hybridize under the above conditions to the complement strand of the open reading frame of SEQ ID NO: 1, 3, 5, 7,9, 11, 13, or 15.

[0061] As used herein, a nucleic acid molecule is said to be “isolated” when the nucleic acid molecule is substantially separated from contaminant nucleic acid molecules encoding other polypeptides.

[0062] The present invention further provides fragments of the nucleic acid molecules. As used herein, a fragment of an encoding nucleic acid molecule refers to a small portion of the entire protein coding sequence. The size of the fragment will be determined by the intended use. For example, if the fragment is chosen so as to encode an active portion of the protein, the fragment will need to be large enough to encode the functional region(s) of the protein. For instance, fragments which encode peptides corresponding to predicted antigenic regions may be prepared. If the fragment is to be used as a nucleic acid probe or PCR primer, then the fragment length is chosen so as to obtain a relatively small number of false positives during probing/priming (see the discussion in Section H).

[0063] Fragments of the nucleic acid molecules of the present invention (i.e., synthetic oligonucleotides) that are used as probes or specific primers for the polymerase chain reaction (PCR), or to synthesize gene sequences encoding proteins of the invention, can easily be synthesized by chemical techniques, for example, the phosphotriester method of Matteucci et al., (1981) (J. Am. Chem. Soc. 103, 3185-3191) or using automated synthesis methods. In addition, larger DNA segments can readily be prepared by well-known methods, such as synthesis of a group of oligonucleotides that define various modular segments of the gene, followed by ligation of oligonucleotides to build the complete modified gene.

[0064] The nucleic acid molecules of the present invention may further be modified so as to contain a detectable label for diagnostic and probe purposes. A variety of such labels are known in the art and can readily be employed with the encoding molecules herein described. Suitable labels include, but are not limited to, biotin, radiolabeled nucleotides and the like. A skilled artisan can readily employ any such label to obtain labeled variants of the nucleic acid molecules of the invention.

[0065] Modifications to the primary structure of the nucleic acid molecules by deletion, addition, or alteration of the nucleotide sequence can be made without destroying the activity of the encoded proteins. Such substitutions or other alterations result in proteins having an amino acid sequence falling within the contemplated scope of the present invention.

[0066] C. Isolation of other Related Nucleic Acid Molecules

[0067] As described above, the identification and characterization of the nucleic acid molecule having SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15 allows a skilled artisan to isolate nucleic acid molecules that encode other members of the protein family in addition to the sequences herein described.

[0068] For instance, a skilled artisan can readily use the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 to generate antibody probes to screen expression libraries prepared from appropriate cells. Typically, polyclonal antiserum from mammals such as rabbits immunized with the purified protein (as described below) or monoclonal antibodies can be used to probe a mammalian cDNA or genomic expression library, such as lambda gt11 library, to obtain the appropriate coding sequence for other members of the protein family. The cloned cDNA sequence can be expressed as a fusion protein, expressed directly using its own control sequences, or expressed by constructions using control sequences appropriate to the particular host used for expression of the enzyme.

[0069] Alternatively, a portion of the coding sequence herein described can be synthesized and used as a probe to retrieve DNA encoding a member of the protein family from any mammalian organism. Oligomers containing approximately 18-20 nucleotides (encoding about a 6-7 amino acid stretch) are prepared and used to screen genomic DNA or cDNA libraries to obtain hybridization under stringent conditions or conditions of sufficient stringency to eliminate an undue level of false positives.

[0070] Additionally, pairs of oligonucleotide primers can be prepared for use in a polymerase chain reaction (PCR) to selectively clone an encoding nucleic acid molecule. A PCR denature/anneal/extend cycle for using such PCR primers is well known in the art and can readily be adapted for use in isolating other encoding nucleic acid molecules.

[0071] Nucleic acid molecules encoding other members of the protein family may also be identified in existing genomic or other sequence information using any available computational method, including but not limited to: PSI-BLAST (Altschul, et al. (1997) Nucleic Acids Res. 25:3389-3402); PHI-BLAST (Zhang, et al. (1998), Nucleic Acids Res. 26:3986-3990), 3D-PSSM (Kelly et al. (2000) J. Mol. Biol. 299(2): 499-520); and other computational analysis methods (Shi et al. (1999) Biochem. Biophys. Res. Commun. 262(1):132-8 and Matsunami et. al (2000) Nature 404(6778):601-4.

[0072] D. rDNA Molecules Containing a Nucleic Acid Molecule

[0073] The present invention further includes recombinant DNA molecules (rDNAs) that contain a coding sequence. As used herein, a rDNA molecule is a DNA molecule that has been subjected to molecular manipulation in situ. Methods for generating rDNA molecules are well known in the art, for example, see Sambrook et al., (1989) Molecular Cloning—A Laboratory Manual, Cold Spring Harbor Laboratory Press. In the preferred rDNA molecules, a coding DNA sequence is operably linked to expression control sequences and/or vector sequences.

[0074] The choice of vector and/or expression control sequences to which one of the protein family encoding sequences of the present invention is operably linked depends directly, as is well known in the art, on the functional properties desired, e.g., protein expression, and the host cell to be transformed. A vector contemplated by the present invention is at least capable of directing the replication or insertion into the host chromosome, and preferably also expression, of the structural gene included in the rDNA molecule.

[0075] Expression control elements that are used for regulating the expression of an operably linked protein encoding sequence are known in the art and include, but are not limited to, inducible promoters, constitutive promoters, secretion signals, and other regulatory elements. Preferably, the inducible promoter is readily controlled, such as being responsive to a nutrient in the host cell's medium.

[0076] In one embodiment, the vector containing a coding nucleic acid molecule will include a prokaryotic replicon, i.e., a DNA sequence having the ability to direct autonomous replication and maintenance of the recombinant DNA molecule extrachromosomally in a prokaryotic host cell, such as a bacterial host cell, transformed therewith. Such replicons are well known in the art. In addition, vectors that include a prokaryotic replicon may also include a gene whose expression confers a detectable marker such as a drug resistance. Typical bacterial drug resistance genes are those that confer resistance to ampicillin or tetracycline.

[0077] Vectors that include a prokaryotic replicon can further include a prokaryotic or bacteriophage promoter capable of directing the expression (transcription and translation) of the coding gene sequences in a bacterial host cell, such as E. coli. A promoter is an expression control element formed by a DNA sequence that permits binding of RNA polymerase and transcription to occur. Promoter sequences compatible with bacterial hosts are typically provided in plasmid vectors containing convenient restriction sites for insertion of a DNA segment of the present invention. Typical of such vector plasmids are pUC8, pUC9, pBR322 and pBR329 available from BioRad Laboratories, (Richmond, Calif.), pPL and pKK223 available from Pharmacia (Piscataway, N.J.).

[0078] Expression vectors compatible with eukaryotic cells, preferably those compatible with vertebrate cells, can also be used to form rDNA molecules that contain a coding sequence. Eukaryotic cell expression vectors, including viral vectors, are well known in the art and are available from several commercial sources. Typically, such vectors are provided containing convenient restriction sites for insertion of the desired DNA segment. Typical of such vectors are pSVL and pKSV-10 (Pharmacia), pBPV-1/pML2d (International Biotechnologies, Inc.), pTDT1 (ATCC, #31255), the vector pCDM8 described herein, and the like eukaryotic expression vectors.

[0079] Eukaryotic cell expression vectors used to construct the rDNA molecules of the present invention may further include a selectable marker that is effective in an eukaryotic cell, preferably a drug resistance selection marker. A preferred drug resistance marker is the gene whose expression results in neomycin resistance, i.e., the neomycin phosphotransferase (neo) gene. (Southern et al., (1982) J. Mol. Anal. Genet. 1, 327-341) Alternatively, the selectable marker can be present on a separate plasmid, and the two vectors are introduced by co-transfection of the host cell, and selected by culturing in the appropriate drug for the selectable marker.

[0080] E. Host Cells Containing an Exogenously Supplied Coding Nucleic Acid Molecule

[0081] The present invention further includes host cells transformed with a nucleic acid molecule that encodes a protein of the present invention. The host cell can be either prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a protein of the invention are not limited, so long as the cell line is compatible with cell culture methods and compatible with the propagation of the expression vector and expression of the gene product. Preferred eukaryotic host cells include, but are not limited to, yeast, insect and mammalian cells, preferably vertebrate cells such as those from a mouse, rat, monkey or human cell line. Preferred eukaryotic host cells include Chinese hamster ovary (CHO) cells available from the ATCC as CCL61, NIH Swiss mouse embryo cells (NIH/3T3) available from the ATCC as CRL 1658, baby hamster kidney cells (BHK), and the like eukaryotic tissue culture cell lines.

[0082] Any prokaryotic host can be used to express a rDNA molecule encoding a protein of the invention. The preferred prokaryotic host is E. coli.

[0083] Transformation of appropriate cell hosts with a rDNA molecule of the present invention is accomplished by well-known methods that typically depend on the type of vector used and host system employed. With regard to transformation of prokaryotic host cells, electroporation and salt treatment methods are typically employed, see, for example, Cohen et al., (1972) Proc. Natl. Acad. Sci. USA 69, 2110; and Sambrook et al., (1989) Molecular Cloning—A Laboratory Manual, Cold Spring Harbor Laboratory Press. With regard to transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic lipid or salt treatment methods are typically employed, see, for example, Graham et al., (1973) Virol. 52, 456; Wigler et al., (1979) Proc. Natl. Acad. Sci. USA 76, 1373-1376.

[0084] Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present invention, can be identified by well known techniques including the selection for a selectable marker. For example, cells resulting from the introduction of an rDNA of the present invention can be cloned to produce single colonies. Cells from those colonies can be harvested, lysed and their DNA content examined for the presence of the rDNA using a method such as that described by Southern, J. Mol. Biol. 98:503, 1975, or Berent et al., (1985) Biotech. 3, 208 or the proteins produced from the cell assayed via an immunological method.

[0085] F. Production of Recombinant Proteins Using a rDNA Molecule

[0086] The present invention further includes methods for producing a protein of the invention using nucleic acid molecules herein described. In general terms, the production of a recombinant form of a protein typically involves the following steps:

[0087] A nucleic acid molecule is first obtained that encodes a protein of the invention, such as a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 1, or the open reading frame defined by nucleotides 70-1038 (or 1041) of SEQ ID NO: 1; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 3, or the open reading frame defined by nucleotides 1-3453 (or 3456) of SEQ ID NO: 3; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 5, or the open reading frame defined by nucleotides 1-3345 (or 3348) of SEQ ID NO: 5; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 7, or the open reading frame defined by nucleotides 317-4093 (or 4096) of SEQ ID NO: 7; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 9, or the open reading frame defined by nucleotides 1-1050 (or 1053) of SEQ ID NO: 9; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 11, or the open reading frame defined by nucleotides 1-1194 (or 1197) of SEQ ID NO: 11; a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 13, or the open reading frame defined by nucleotides 1-321 (or 324) of SEQ ID NO: 13; or a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ ID NO: 15, or the open reading frame defined by nucleotides 9-257 (or 260) of SEQ ID NO: 15. If the encoding sequence is uninterrupted by introns, as are these open reading frame, it is directly suitable for expression in any host.

[0088] The nucleic acid molecule is then preferably placed in operable linkage with suitable control sequences, as described above, to form an expression unit containing the protein open reading frame. The expression unit is used to transform a suitable host and the transformed host is cultured under conditions that allow the production of the recombinant protein. Optionally the recombinant protein is isolated from the medium or from the cells; recovery and purification of the protein may not be necessary in some instances where some impurities may be tolerated.

[0089] Each of the foregoing steps can be done in a variety of ways. For example, the desired coding sequences may be obtained from genomic fragments and used directly in appropriate hosts. The construction of expression vectors that are operable in a variety of hosts is accomplished using appropriate replicons and control sequences, as set forth above. The control sequences, expression vectors, and transformation methods are dependent on the type of host cell used to express the gene and were discussed in detail earlier. Suitable restriction sites can, if not normally available, be added to the ends of the coding sequence so as to provide an excisable gene to insert into these vectors. A skilled artisan can readily adapt any host/expression system known in the art for use with the nucleic acid molecules of the invention to produce recombinant protein.

[0090] G. Methods to Identify Binding Partners

[0091] Another embodiment of the present invention includes methods of isolating and identifying binding partners of proteins of the invention. By recitation of “binding partner,” the present invention includes ligands or receptors which bind specifically to MC21, MC22.1, MC22.2, MC25, MC33, MC36 or MC39. Further, binding partners also include protein domains which associate via at least one Lys residue within the transmembrane domain at position 282, 292, 293, 294, 296, 298, or 301 of SEQ ID NO: 2.

[0092] In general, a protein of the invention is mixed with a potential binding partner or an extract or fraction of a cell under conditions that allow the association of potential binding partners with the protein of the invention. After mixing, peptides, polypeptides, proteins or other molecules that have become associated with a protein of the invention are separated from the mixture. The binding partner that bound to the protein of the invention can then be removed and further analyzed.

[0093] To identify and isolate a binding partner, the entire protein, for instance a protein comprising the entire amino acid sequence of SEQ ID NO: 2 or comprising the sequence of the mature MC21 protein without the signal peptide can be used. For example, a protein or polypeptide consisting of or comprising about amino acid residues 39 to 323 of SEQ ID NO: 2, can be used. Further, a protein or polypeptide consisting of or comprising about amino acid residues 1 to 1151 of SEQ ID NO: 4, 1 to 1115 of SEQ ID NO: 6, 1 to 1259 of SEQ ID NO: 8, 1 to 350 of SEQ ID NO: 10, 1 to 398 of SEQ ID NO: 12, 1 to 107 of SEQ ID NO: 14, or 1 to 83 of SEQ ID NO: 4 can also be used. Alternatively, a fragment of the protein preferably having a consecutive sequence of at least about 25 or 30 amino acid residues, more preferably at least about 35 or 40 amino acid residues, even more preferably at least about 45 or 50 amino acid residues, yet more preferably at least about 55 or 60, still more preferably at least about 65 or 70 amino acid residues and most preferably at least about 75 or more amino acid residues can be used. Further, a fragment of the protein comprising at least the extracellular domain of MC21 or comprising at least the extracellular and transmembrane domains of MC21 can be used to identify and isolate a binding partner. For example, a fragment consisting of or comprising about amino acid residues 39-278 or 39-301 of SEQ ID NO: 2 can be used to identify and isolate a binding partner.

[0094] As used herein, a cellular extract refers to a preparation or fraction which is made from a lysed or disrupted cell. The preferred source of cellular extracts will be cells derived from human skin tissue or the human respiratory tract or cells derived from a biopsy sample of human lung tissue in patients with allergic hypersensitivity. Alternatively, cellular extracts may be prepared from normal tissue or available cell lines, particularly granulocytic cell lines.

[0095] A variety of methods can be used to obtain an extract of a cell. Cells can be disrupted using either physical or chemical disruption methods. Examples of physical disruption methods include, but are not limited to, sonication and mechanical shearing. Examples of chemical lysis methods include, but are not limited to, detergent lysis and enzyme lysis. A skilled artisan can readily adapt methods for preparing cellular extracts in order to obtain extracts for use in the present methods.

[0096] Once an extract of a cell is prepared, the extract is mixed with the protein of the invention under conditions in which association of the protein with the binding partner can occur. A variety of conditions can be used, the most preferred being conditions that closely resemble conditions found in the cytoplasm of a human cell. Features such as osmolarity, pH, temperature, and the concentration of cellular extract used, can be varied to optimize the association of the protein with the binding partner.

[0097] After mixing under appropriate conditions, the bound complex is separated from the mixture. A variety of techniques can be utilized to separate the mixture. For example, antibodies specific to a protein of the invention can be used to immunoprecipitate the binding partner complex. Alternatively, standard chemical separation techniques such as chromatography and density/sediment centrifugation can be used.

[0098] After removal of non-associated cellular constituents found in the extract, the binding partner can be dissociated from the complex using conventional methods. For example, dissociation can be accomplished by altering the salt concentration or pH of the mixture.

[0099] To aid in separating associated binding partner pairs from the mixed extract, the protein of the invention can be immobilized on a solid support. For example, the protein can be attached to a nitrocellulose matrix or acrylic beads. Attachment of the protein to a solid support aids in separating peptide/binding partner pairs from other constituents found in the extract. The identified binding partners can be either a single protein or a complex made up of two or more proteins. Alternatively, binding partners may be identified using a Far-Western assay according to the procedures of Takayama et al., (1997) Methods Mol. Biol. 69:171-184 or Sauder et al., (1996) J. Gen. Virol. 77:991-996 or identified through the use of epitope tagged proteins or GST fusion proteins.

[0100] Alternatively, the nucleic acid molecules of the invention can be used in a yeast two-hybrid system. The yeast two-hybrid system has been used to identify other protein partner pairs and can readily be adapted to employ the nucleic acid molecules herein described.

[0101] H. Methods to Identify Agents that Modulate the Expression of a Nucleic Acid Encoding the Gene Associated with Mast Cell Degranulation or Allergic Hypersensitivity

[0102] Another embodiment of the present invention includes methods for identifying agents that modulate the expression of a nucleic acid encoding a protein of the invention such as a protein having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. Such assays may utilize any available means of monitoring for changes in the expression level of the nucleic acids of the invention. As used herein, an agent is said to modulate the expression of a nucleic acid of the invention if it is capable of up- or down-regulating expression of the nucleic acid in a cell.

[0103] In one assay format, cell lines that contain reporter gene fusions between the open reading frame defined by nucleotides 70-1041 of SEQ ID NO: 1, and/or the 5′ and/or 3′ regulatory elements and any assayable fusion partner may be prepared. Alternatively, in the same, a separate or a related assay format, cell lines that contain reporter gene fusions between the open reading frame defined by nucleotides 1-3453 of SEQ ID NO: 3, nucleotides 1-3345 of SEQ ID NO: 5, nucleotides 317-4093 of SEQ ID NO: 7, nucleotides 1-1050 of SEQ ID NO: 9, nucleotides 1-1194 of SEQ ID NO: 11, nucleotides 1-321 of SEQ ID NO: 13, or nucleotides 9-257 of SEQ ID NO: 15, and/or the 5′ and/or 3′ regulatory elements and any assayable fusion partner may be prepared. Numerous assayable fusion partners are known and readily available including the firefly luciferase gene and the gene encoding chloramphenicol acetyltransferase (Alam et al., (1990) Anal. Biochem. 188, 245-254). Cell lines containing the reporter gene fusions are then exposed to the agent to be tested under appropriate conditions and time. Differential expression of the reporter gene between samples exposed to the agent and control samples identifies agents which modulate the expression of a nucleic acid of the invention.

[0104] Additional assay formats may be used to monitor the ability of the agent to modulate the expression of a nucleic acid encoding a protein of the invention, such as the protein having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. For instance, mRNA expression may be monitored directly by hybridization to the nucleic acids of the invention. Cell lines are exposed to the agent to be tested under appropriate conditions and time and total RNA or mRNA is isolated by standard procedures such those disclosed in Sambrook et al., (1989) Molecular Cloning—A Laboratory Manual, Cold Spring Harbor Laboratory Press).

[0105] Probes to detect differences in RNA expression levels between cells exposed to the agent and control cells may be prepared from the nucleic acids of the invention. It is preferable, but not necessary, to design probes which hybridize only with target nucleic acids under conditions of high stringency. Only highly complementary nucleic acid hybrids form under conditions of high stringency. Accordingly, the stringency of the assay conditions determines the amount of complementarity which should exist between two nucleic acid strands in order to form a hybrid. Stringency should be chosen to maximize the difference in stability between the probe:target hybrid and probe:non-target hybrids.

[0106] Probes may be designed from the nucleic acids of the invention through methods known in the art. For instance, the G+C content of the probe and the probe length can affect probe binding to its target sequence. Methods to optimize probe specificity are commonly available in Sambrook et al., (1989) Molecular Cloning—A Laboratory Manual, Cold Spring Harbor Laboratory Press) or Ausubel et al., (1995) Current Protocols in Molecular Biology, Greene Publishing Co.

[0107] Hybridization conditions are modified using known methods, such as those described by Sambrook et al. and Ausubel et al. as required for each probe. Hybridization of total cellular RNA or RNA enriched for polyA RNA can be accomplished in any available format. For instance, total cellular RNA or RNA enriched for polyA RNA can be affixed to a solid support and the solid support exposed to at least one probe comprising at least one, or part of one of the sequences of the invention under conditions in which the probe will specifically hybridize. Alternatively, nucleic acid fragments comprising at least one, or part of one of the sequences of the invention can be affixed to a solid support, such as a silicon chip or a porous glass wafer. The wafer can then be exposed to total cellular RNA or polyA RNA from a sample under conditions in which the affixed sequences will specifically hybridize. Such solid supports and hybridization methods are widely available, for example, those disclosed by Beattie, (1995) WO 95/11755. By examining for the ability of a given probe to specifically hybridize to an RNA sample from an untreated cell population and from a cell population exposed to the agent, agents which up or down regulate the expression of a nucleic acid encoding the protein having the sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16 are identified.

[0108] Hybridization for qualitative and quantitative analysis of mRNAs may also be carried out by using a RNase Protection Assay (i.e., RPA, see Ma et al. (1996) Methods 10, 273-238). Briefly, an expression vehicle comprising cDNA encoding the gene product and a phage specific DNA dependent RNA polymerase promoter (e.g., T7, T3 or SP6 RNA polymerase) is linearized at the 3′ end of the cDNA molecule, downstream from the phage promoter, wherein such a linearized molecule is subsequently used as a template for synthesis of a labeled antisense transcript of the cDNA by in vitro transcription. The labeled transcript is then hybridized to a mixture of isolated RNA (i.e., total or fractionated mRNA) by incubation at 45 C overnight in a buffer comprising 80% formamide, 40 mM Pipes, pH 6.4, 0.4 M NaCl and 1 mM EDTA. The resulting hybrids are then digested in a buffer comprising 40 g/ml ribonuclease A and 2 g/ml ribonuclease. After deactivation and extraction of extraneous proteins, the samples are loaded onto urea/polyacrylamide gels for analysis.

[0109] In another assay format, cells or cell lines are first identified which express the gene products of the invention physiologically. Cell and/or cell lines so identified would be expected to comprise the necessary cellular machinery such that the fidelity of modulation of the transcriptional apparatus is maintained with regard to exogenous contact of agent with appropriate surface transduction mechanisms and/or the cytosolic cascades. Further, such cells or cell lines would be transduced or transfected with an expression vehicle (e.g., a plasmid or viral vector) construct comprising an operable non-translated 5′-promoter containing end of the structural gene encoding the instant gene products fused to one or more antigenic fragments, which are peculiar to the instant gene products, wherein said fragments are under the transcriptional control of said promoter and are expressed as polypeptides whose molecular weight can be distinguished from the naturally occurring polypeptides or may further comprise an immunologically distinct tag or other detectable marker. Such a process is well known in the art (see Sambrook et al., (1989)).

[0110] Cells or cell lines transduced or transfected as outlined above are then contacted with agents under appropriate conditions; for example, the agent in a pharmaceutically acceptable excipient is contacted with cells in an aqueous physiological buffer such as phosphate buffered saline (PBS) at physiological pH, Eagles balanced salt solution (BSS) at physiological pH, PBS or BSS comprising serum or conditioned media comprising PBS or BSS and/or serum incubated at 37 C. Said conditions may be modulated as deemed necessary by one of skill in the art. Subsequent to contacting the cells with the agent, said cells will be disrupted and the polypeptides of the lysate are fractionated such that a polypeptide fraction is pooled and contacted with an antibody to be further processed by immunological assay (e.g., ELISA, immunoprecipitation or Western blot). The pool of proteins isolated from the “agent-contacted” sample will be compared with a control sample where only the excipient is contacted with the cells and an increase or decrease in the immunologically generated signal from the “agent-contacted” sample compared to the control will be used to distinguish the effectiveness of the agent.

[0111] I. Methods to Identify Agents that Modulate the Level of or at Least One Activity of the Protein Associated with Mast Cell Degranulation and/or Allergic Hypersensitivity

[0112] Another embodiment of the present invention includes methods for identifying agents that modulate at least one activity of a protein of the invention such as the protein having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16. Such methods or assays may utilize any means of monitoring or detecting the desired activity.

[0113] In one format, the relative amounts of a protein of the invention between a cell population that has been exposed to the agent to be tested compared to an un-exposed control cell population may be assayed. In this format, probes such as specific antibodies are used to monitor the differential expression of the protein in the different cell populations. Cell lines or populations are exposed to the agent to be tested under appropriate conditions and time. Cellular lysates may be prepared from the exposed cell line or population and a control, unexposed cell line or population. The cellular lysates are then analyzed with the probe.

[0114] Antibody probes are prepared by immunizing suitable mammalian hosts in appropriate immunization protocols using the peptides, polypeptides or proteins of the invention if they are of sufficient length, or, if desired, or if required to enhance immunogenicity, conjugated to suitable carriers. Methods for preparing immunogenic conjugates with carriers such as BSA, KLH, or other carrier proteins are well known in the art. In some circumstances, direct conjugation using, for example, carbodiimide reagents may be effective; in other instances linking reagents such as those supplied by Pierce Chemical Co. (Rockford, Ill.) may be desirable to provide accessibility to the hapten. The hapten peptides can be extended at either the amino or carboxy terminus with a cysteine residue or interspersed with cysteine residues, for example, to facilitate linking to a carrier. Administration of the immunogens is conducted generally by injection over a suitable time period and with use of suitable adjuvants, as is generally understood in the art. During the immunization schedule, titers of antibodies are taken to determine adequacy of antibody formation.

[0115] While the polyclonal antisera produced in this way may be satisfactory for some applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared using the standard method of Kohler and Milstein (Nature (1975) 256:495-497) or modifications which effect immortalization of lymphocytes or spleen cells, as is generally known. The immortalized cell lines secreting the desired antibodies are screened by immunoassay in which the antigen is the peptide hapten, polypeptide or protein. When the appropriate immortalized cell culture secreting the desired antibody is identified, the cells can be cultured either in vitro or by production in ascites fluid.

[0116] The desired monoclonal antibodies are then recovered from the culture supernatant or from the ascites supernatant. Fragments of the monoclonal antibodies or the polyclonal antisera which contain the immunologically significant portion can be used as antagonists, as well as the intact antibodies. Use of immunologically reactive antibody fragments, such as the Fab, Fab′, of F(ab′)₂ fragments is often preferable, especially in a therapeutic context, as these fragments are generally less immunogenic than the whole immunoglobulin.

[0117] The antibodies or fragments may also be produced, using current technology, by recombinant means. Antibody regions that bind specifically to the desired regions of the protein can also be produced in the context of chimeras with multiple species origin, such as humanized antibodies. The antibodies may be used in any of the methods described herein, may be used as diagnostic agents or may be used as therapeutic agents.

[0118] Agents that are assayed in the above method can be randomly selected or rationally selected or designed. As used herein, an agent is said to be randomly selected when the agent is chosen randomly without considering the specific sequences involved in the association of a protein of the invention alone or with its associated substrates, binding partners, etc. An example of randomly selected agents is the use a chemical library or a peptide combinatorial library, or a growth broth of an organism.

[0119] As used herein, an agent is said to be rationally selected or designed when the agent is chosen on a nonrandom basis which takes into account the sequence of the target site and/or its conformation in connection with the agents action. Agents can be rationally selected or rationally designed by utilizing the peptide sequences that make up these sites. For example, a rationally selected peptide agent can be a peptide whose amino acid sequence is identical to or a derivative of any functional consensus site.

[0120] The agents of the present invention can be, as examples, peptides, small molecules, vitamin derivatives, as well as carbohydrates. Dominant negative proteins, DNAs encoding these proteins, antibodies to these proteins, peptide fragments of these proteins or mimics of these proteins may be introduced into cells to affect function. “Mimic” used herein refers to the modification of a region or several regions of a peptide molecule to provide a structure chemically different from the parent peptide but topographically and functionally similar to the parent peptide (see Grant GA. in: Meyers (ed.) Molecular Biology and Biotechnology (New York, VCH Publishers, 1995), pp. 659-664). A skilled artisan can readily recognize that there is no limit as to the structural nature of the agents of the present invention.

[0121] The peptide agents of the invention can be prepared using standard solid phase (or solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA encoding these peptides may be synthesized using commercially available oligonucleotide synthesis instrumentation and produced recombinantly using standard recombinant production systems. The production using solid phase peptide synthesis is necessitated if non-gene-encoded amino acids are to be included.

[0122] Another class of agents of the present invention are antibodies immunoreactive with critical positions of proteins of the invention. Antibody agents are obtained by immunization of suitable mammalian subjects with peptides, containing as antigenic regions, those portions of the protein intended to be targeted by the antibodies as described above.

[0123] J. Uses for Agents that Modulate at Least One Activity of the Protein.

[0124] As provided in the Examples, the proteins and nucleic acids of the invention, such as the protein having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, are differentially expressed during mast cell degranulation and/or in allergic hypersensitivity disease states. Agents that modulate or up-or-down-regulate the expression of the protein or agents, such as agonists or antagonists of at least one activity of the protein, may be used to modulate biological and pathologic processes associated with the protein's function and activity.

[0125] As used herein, a subject can be any mammal, so long as the mammal is in need of modulation of a pathological or biological process mediated by a protein of the invention. The term mammal is defined as an individual belonging to the class Mammalia. The invention is particularly useful in the treatment of human subjects.

[0126] Pathological processes refer to a category of biological processes which produce a deleterious effect. For example, expression of a protein of the invention may be associated with allergic hypersensitivity. As used herein, an agent is said to modulate a pathological process when the agent reduces the degree or severity of the process. For instance, allergic hypersensitivity may be prevented or disease progression modulated by the administration of agents which up- or down-regulate or modulate in some way the expression or at least one activity of a protein of the invention.

[0127] The agents of the present invention can be provided alone, or in combination with other agents that modulate a particular pathological process. For example, an agent of the present invention can be administered in combination with other known drugs. As used herein, two agents are said to be administered in combination when the two agents are administered simultaneously or are administered independently in a fashion such that the agents will act at the same time.

[0128] The agents of the present invention can be administered via parenteral, subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, or buccal routes. Alternatively, or concurrently, administration may be by the oral route. The dosage administered will be dependent upon the age, health, and weight of the recipient, kind of concurrent treatment, if any, frequency of treatment, and the nature of the effect desired.

[0129] The present invention further provides compositions containing one or more agents which modulate expression or at least one activity of a protein of the invention. While individual needs vary, determination of optimal ranges of effective amounts of each component is within the skill of the art. Typical dosages comprise 0.1 to 100 g/kg body wt. The preferred dosages comprise 0.1 to 10 g/kg body wt. The most preferred dosages comprise 0.1 to 1 g/kg body wt.

[0130] In addition to the pharmacologically active agent, the compositions of the present invention may contain suitable pharmaceutically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically for delivery to the site of action. Suitable formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble form, for example, water-soluble salts. In addition, suspensions of the active compounds as appropriate oily injection suspensions may be administered. Suitable lipophilic solvents or vehicles include fatty oils, for example, sesame oil, or synthetic fatty acid esters, for example, ethyl oleate or triglycerides. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension include, for example, sodium carboxymethyl cellulose, sorbitol, and/or dextran. Optionally, the suspension may also contain stabilizers. Liposomes can also be used to encapsulate the agent for delivery into the cell.

[0131] The pharmaceutical formulation for systemic administration according to the invention may be formulated for enteral, parenteral or topical administration. Indeed, all three types of formulations may be used simultaneously to achieve systemic administration of the active ingredient.

[0132] Suitable formulations for oral administration include hard or soft gelatin capsules, pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and controlled release forms thereof.

[0133] In practicing the methods of this invention, the compounds of this invention may be used alone or in combination, or in combination with other therapeutic or diagnostic agents. In certain preferred embodiments, the compounds of this invention may be coadministered along with other compounds typically prescribed for these conditions according to generally accepted medical practice such as antihistamines. The compounds of this invention can be utilized in vivo, ordinarily in mammals, such as humans, sheep, horses, cattle, pigs, dogs, cats, rats and mice, or in vitro.

[0134] K. Transgenic Animals

[0135] Transgenic animals containing mutant, knock-out or modified genes corresponding to the cDNA sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15, or the open reading frame encoding the polypeptide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, or fragments thereof preferably having a consecutive sequence of at least about 25 or 30 amino acid residues, more preferably 35 or 40 amino acid residues, even more preferably 45 or 50 amino acid residues, yet more preferably 55 or 60, still more preferably 65 or 70 amino acid residues and most preferably at least 75 or more amino acid residues, are also included in the invention. Transgenic animals are genetically modified animals into which recombinant, exogenous or cloned genetic material has been experimentally transferred. Such genetic material is often referred to as a “transgene.” The nucleic acid sequence of the transgene, in this case a form of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15 may be integrated either at a locus of a genome where that particular nucleic acid sequence is not otherwise normally found or at the normal locus for the transgene. The transgene may consist of nucleic acid sequences derived from the genome of the same species or of a different species than the species of the target animal.

[0136] The term “germ cell line transgenic animal” refers to a transgenic animal in which the genetic alteration or genetic information was introduced into a germ line cell, thereby conferring the ability of the transgenic animal to transfer the genetic information to offspring. If such offspring in fact possess some or all of that alteration or genetic information, then they too are transgenic animals.

[0137] The alteration or genetic information may be foreign to the species of animal to which the recipient belongs, foreign only to the particular individual recipient, or may be genetic information already possessed by the recipient. In the last case, the altered or introduced gene may be expressed differently than the native gene.

[0138] Transgenic animals can be produced by a variety of different methods including transfection, electroporation, microinjection, gene targeting in embryonic stem cells and recombinant viral and retroviral infection (see, e.g., U.S. Pat. No. 4,736,866; U.S. Pat. No. 5,602,307; Mullins et al., (1993) Hypertension 22, 630-633; Brenin et al., (1997) Surg. Oncol. 6, 99-110; Tuan (1997) Recombinant Gene Expression Protocols, Methods in Molecular Biology, Humana Press).

[0139] A number of recombinant or transgenic mice have been produced, including those which express an activated oncogene sequence (U.S. Pat. No. 4,736,866); express simian SV40 T-antigen (U.S. Pat. No. 5,728,915); lack the expression of interferon regulatory factor 1 (IRF-1) (U.S. Pat. No. 5,731,490); exhibit dopaminergic dysfunction (U.S. Pat. No. 5,723,719); express at least one human gene which participates in blood pressure control (U.S. Pat. No. 5,731,489); display greater similarity to the conditions existing in naturally occurring Alzheimer's disease (U.S. Pat. No. 5,720,936); have a reduced capacity to mediate cellular adhesion (U.S. Pat. No. 5,602,307); possess a bovine growth hormone gene (Clutter et al., (1996) Genetics 143, 1753-1760); or, are capable of generating a fully human antibody response (McCarthy (1997) Lancet 349, 405).

[0140] While mice and rats remain the animals of choice for most transgenic experimentation, in some instances it is preferable or even necessary to use alternative animal species. Transgenic procedures have been successfully utilized in a variety of non-murine animals, including sheep, goats, pigs, dogs, cats, monkeys, chimpanzees, hamsters, rabbits, cows and guinea pigs (see, e.g., Kim et al., (1997) Mol. Reprod. Dev. 46, 515-526; Houdebine (1995) Reprod. Nutr. Dev. 35, 609-617; Petters (1994) Reprod. Fertil. Dev. 6, 643-645; Schnieke et al., (1997) Science 278, 2130-2133; and Amoah (1997) J. Animal Science 75, 578-585).

[0141] The method of introduction of nucleic acid fragments into recombination competent mammalian cells can be by any method which favors co-transformation of multiple nucleic acid molecules. Detailed procedures for producing transgenic animals are readily available to one skilled in the art, including the disclosures in U.S. Pat. No. 5,489,743 and U.S. Pat. No. 5,602,307.

[0142] L. Diagnostic Methods and Agents

[0143] The genes and proteins of the invention may be used to diagnose or monitor allergic hypersensitivity diseases, seasonal rhinitis, asthma, atopic dermatitis and amstocytosis or to track disease progression. One means of diagnosing allergic hypersensitivity using the nucleic acid molecules or proteins of the invention involves obtaining cells from skin, lung or respiratory tract tissue from living subjects. When possible, mucosal secretions, urine, blood or peripheral lymphocyte samples may be used as the tissue sample in the assay.

[0144] The use of molecular biological tools has become routine in forensic technology. For example, nucleic acid probes may be used to determine the expression of a nucleic acid molecule comprising all or at least part of the sequences of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15 in forensic/pathology specimens. Further, nucleic acid assays may be carried out by any means of conducting a transcriptional profiling analysis. In addition to nucleic acid analysis, forensic methods of the invention may target the proteins of the invention, particularly a protein comprising SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, to determine up or down regulation of the genes (Shiverick et al., (1975) Biochim Biophys Acta 393, 124-133).

[0145] Methods of the invention may involve treatment of tissues with collagenases or other proteases to make the tissue amenable to cell lysis (Semenov et al., (1987) Biull Eksp Biol Med 104, 113-116). Further, it is possible to obtain biopsy samples from different regions of the skin, respiratory tract or lungs for analysis.

[0146] Assays to detect nucleic acid or protein molecules of the invention may be in any available format. Typical assays for nucleic acid molecules include hybridization or PCR based formats. Typical assays for the detection of proteins, polypeptides or peptides of the invention include the use of antibody probes in any available format such as in situ binding assays, etc. See Harlow & Lane, (1988) Antibodies—A Laboratory Manual, Cold Spring Harbor Laboratory Press. In preferred embodiments, assays are carried-out with appropriate controls.

[0147] The above methods may also be used in other diagnostic protocols, including protocols and methods to detect disease states in other tissues or organs, for example the tissues in which gene expression is detected.

[0148] Without further description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, make and utilize the compounds of the present invention and practice the claimed methods. The following working examples therefore, specifically point out preferred embodiments of the present invention, and are not to be construed as limiting in any way the remainder of the disclosure.

EXAMPLES Example 1

[0149] Identification of the Differentially Expressed MC21 mRNA in Mast Cells

[0150] To identify genes which are differentially expressed or regulated in mast cells, mRNA was isolated from cultured human mast cells, isolated hematopoietic cells, and normal human tissues and used in an electronic Northern assay to probe a series of target sequences from the human genome. The intensity of expression of AF150143 (MC21; SEQ ID NO: 1) was measured across a panel of tissues and cell cultures using the Affymetrix human GeneChip 95K chip set. The mean intensity of expression is shown in FIG. 1. The number of data points included in each mean value is given in parentheses to the right of the tissue name. Mast cell samples were cultured for 6 weeks and then harvested or treated as specified. Rest=untreated. Act=3 hours after activation by IgE and antigen. Dex=24 hours after treatement by 10⁻⁷ M dexamethasone. Il-4=5 days after treatment with 10 ng/ml IL-4. T cells stim=stimulation by antibodies to CD3 and CD28.

Example 2

[0151] Cloning of a Full Length Human cDNA Corresponding: to the Differentially Expressed mRNA species

[0152] The full-length cDNA having SEQ ID NO: 1 was obtained by the solution hybridization method. Briefly, a gene-specific oligo was designed based on the sequence of the EST fragment identified in Example 1. The oligo was labeled with biotin and used to hybridize with 5 μg of single strand plasmid DNA (cDNA recombinants) from a human resting mast cell library following the procedures from the GeneTrapper® kit obtained from Invitrogen. The hybridized cDNAs were separated by streptavidin-conjugated beads and eluted by TE buffer. The eluted cDNA was converted to double strand plasmid DNA and used to transform E. Coli cells (DH 5).

[0153] The nucleotide sequences of the full-length human cDNA corresponding to the differentially regulated mRNAs detected above is set forth in SEQ ID NO: 1. The cDNA comprises 1596 base pairs, with a open reading frame at nucleotides 70-1038 (nucleotides 70-1041 with the TAA stop codon) encoding a protein of 323 amino acids. The amino acid sequence corresponding to this encoded protein is set forth in SEQ ID NO: 2. FIG. 2 shows a Northern blot using MC21 cDNA clone BD9 as probe, in which the expression level of a 1.79 kb mRNA species corresponding to SEQ ID NO: 1 was measured in human cultured mast cells or tissues. Lane 1) activated mast cells, Lane 2) resting mast cells, Lane 3) kidney, Lane 4) fetal liver, Lane 5) heart, Lane 6) placenta. A probe based on SEQ ID NO: 1 (randomly labeled, using the Stratagene PRIME-IT II kit) was exposed to human mRNA blots (ClonTech mRNA blot-H4) using sensitive hybridization conditions (Church-Gilbert buffer, 29 hours at room temperature). Significant expression of a 1.79 Kb species was seen in the resting mast cell sample and lower levels were seen in the activated mast cells.

Example 3

[0154] Detection of Allergic Hypersensitivity in a Patient

[0155] The expression level of a nucleic acid or protein of the invention is determined in a sample from a patient suspected of allergic hypersensitivity before and/or after exposure to a potential antigen. The sample may be from an epithelial tissue such as a skin, respiratory tract or lung cell samples, or in urine or in blood samples. The sample may also be bronchoalveloar wash. Tissue or isolated mast cell samples from a patient known to have allergic hypersensitivity and from normal subjects may be used as positive and negative controls, respectively. A change in the level of expression of the nucleic acid or protein of the invention compared to the expression level in control or normal tissue may be indicative of allergic hypersensitivity.

Example 4

[0156] Hydrophobicity and Antigenicity of MC21

[0157] The translation of the open reading frame of cDNA clone OB4, residues 70-1038 of SEQ ID NO: 2, which corresponds to the predicted full-length protein sequence of MC21 (SEQ ID NO: 2) was analyzed by the Kile-Doolittle method for hydrophobic regions within the protein (FIG. 3). The sequence was further analyzed using the Parker method for the presence of particular antigenic regions or epitopes (FIG. 4).

Example 5

[0158] Identification of Additional Differentially Expressed mRNAs in Mast Cells

[0159] Several additional genes were identified which were differentially expressed in mast cells versus other human cell types and tissues. In an electronic Northern assay, the Affymetrix human GeneChip 95K chip set was probed using mRNA samples from cultured human mast cells, isolated hematopoietic cells, and normal human tissues. cDNAs were obtained by multiple methods. CDNAs for MC25 and MC39 were obtained by solution hybridization as in Example 2. cDNAs for MC22, MC33, and MC36 were obtained by 5′ RACE from the est sequence to determine the 5′ end. This sequence as well as human genomic sequence information was used to design primers to PCR the cDNA from mast cell RNA.

[0160]FIG. 5 shows the intensity of expression of MC22. SEQ ID NOs: 3 and 5 represent splice-variants of the full-length human cDNA corresponding to the differentially expressed mRNAs detected, differing from one another by the insert at nucleotides 2147-2254 of SEQ ID NO: 3. FIG. 6 shows the intensity of expression of MC25 (SEQ ID NO: 7), FIG. 7 shows the intensity of expression of MC33 (SEQ ID NO: 9), and FIG. 8 shows the intensity of expression of MC36 (SEQ ID NO: 11). FIG. 9 shows the intensity of expression of MC39, which is represented by the variants of SEQ ID NOs: 13 and 15).

[0161] For each figure, the mean intensity of expression, measured using the Affymetrix human GeneChip 95K chip set, is shown. The number of data points included in each mean value is given in parentheses to the right of the tissue name. Mast cell samples were cultured for 6 weeks and then harvested or treated as specified. Rest=untreated. Act=3 hours after activation by IgE and antigen. Dex=24 hours after treatment by 10-7 M dexamethasone. Il-4=5 days after treatment with 10 ng/ml IL-4. T cells stim=stimulation by antibodies to CD3 and CD28.

[0162] A summary of the genes identified here as being differentially expressed and/or regulated in human mast cells is presented in the Table. Mast Cell DNA Protein Gene Seq Seq Affymetrix Affymetrix Name ID NO: ID NO: Identifier CHIP/ID Description MC21 1  2 AF150143 U95/78123 siglec-like receptor MC22 3, 5 4, 6 AI911742 U95/71487 homologous to rat Tomosyn MC25 7  8 AI631846 U95/75150 kelch domain MC33 9 10 AI733048 U95/90676 homologous to V ATPase D subunit MC36 11  12 AL117442 U95/49047 acyl transferase, Ca binding MC39 13, 15 14, 16 AA932726 U95/83815 poly proline

[0163] Although the present invention has been described in detail with reference to examples above, it is understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the following claims. All cited patents, patent applications and publications referred to in this application are herein incorporated by reference in their entirety.

1 16 1 1596 DNA Homo sapiens CDS (70)..(1041) 1 gtgcagaggc ggatgacctc tgaccacgtg tgggtttgag gttcctcttc tgtgaactgc 60 caagatctc atg ctc ctc ccc aca gcc ctc ttc tct gct cac aca gga agc 111 Met Leu Leu Pro Thr Ala Leu Phe Ser Ala His Thr Gly Ser 1 5 10 cca gga agc ctc tgc ctc aga gat gct gcc gct gct gct gcc gct gcc 159 Pro Gly Ser Leu Cys Leu Arg Asp Ala Ala Ala Ala Ala Ala Ala Ala 15 20 25 30 cct gct gtg ggc agg gcc ctc gct cag gat gca aga ttc cgg ctg gag 207 Pro Ala Val Gly Arg Ala Leu Ala Gln Asp Ala Arg Phe Arg Leu Glu 35 40 45 atg cca gag tcc gtg acg gtg cag gag ggt ctg tgc atc ttt gtg cac 255 Met Pro Glu Ser Val Thr Val Gln Glu Gly Leu Cys Ile Phe Val His 50 55 60 tgt tcg gtc ttc tac ctc gag tat ggc tgg aaa gat tct acc cct gct 303 Cys Ser Val Phe Tyr Leu Glu Tyr Gly Trp Lys Asp Ser Thr Pro Ala 65 70 75 tat ggc cac tgg ttc cgg gaa ggg gtc agt gta gac cag gag act cca 351 Tyr Gly His Trp Phe Arg Glu Gly Val Ser Val Asp Gln Glu Thr Pro 80 85 90 gtg gcc aca aac aac tca act caa aaa gtg cag aag gag acc cag ggc 399 Val Ala Thr Asn Asn Ser Thr Gln Lys Val Gln Lys Glu Thr Gln Gly 95 100 105 110 cga ttc cac ctc ctc ggt gat ccc tca agg aac aac tgc tcc ctg agc 447 Arg Phe His Leu Leu Gly Asp Pro Ser Arg Asn Asn Cys Ser Leu Ser 115 120 125 atc aga gac gcc agg agg agg gac aac ggt tca tac ttc ttt tgg gtg 495 Ile Arg Asp Ala Arg Arg Arg Asp Asn Gly Ser Tyr Phe Phe Trp Val 130 135 140 gcg aga gga aga aca aaa ttt agt tac aaa tat tcc ccg ctc tct gtg 543 Ala Arg Gly Arg Thr Lys Phe Ser Tyr Lys Tyr Ser Pro Leu Ser Val 145 150 155 tat gtg aca gcc ctg acc cac agg ccc gac atc ctc atc ccg gag ttc 591 Tyr Val Thr Ala Leu Thr His Arg Pro Asp Ile Leu Ile Pro Glu Phe 160 165 170 cta aag tct ggc cat ccc agc aac ctg acc tgc tct gtg ccc tgg gtc 639 Leu Lys Ser Gly His Pro Ser Asn Leu Thr Cys Ser Val Pro Trp Val 175 180 185 190 tgt gag cag gga aca ccc ccc atc ttc tcc tgg atg tca gct gcc ccc 687 Cys Glu Gln Gly Thr Pro Pro Ile Phe Ser Trp Met Ser Ala Ala Pro 195 200 205 acc tcc ctg ggc ccc agg acc ctc cac tcc tca gtg ctc acg atc atc 735 Thr Ser Leu Gly Pro Arg Thr Leu His Ser Ser Val Leu Thr Ile Ile 210 215 220 cca cgg cct cag gac cac ggc acc aac ctc atc tgt cag gtg acg ttc 783 Pro Arg Pro Gln Asp His Gly Thr Asn Leu Ile Cys Gln Val Thr Phe 225 230 235 ccc gga gct ggt gtg acc acg gag aga acc atc cag ctc agt gtc tcc 831 Pro Gly Ala Gly Val Thr Thr Glu Arg Thr Ile Gln Leu Ser Val Ser 240 245 250 tat cct cta atg aac cca gtg att gaa gcc tcc cta gaa gac ggc aca 879 Tyr Pro Leu Met Asn Pro Val Ile Glu Ala Ser Leu Glu Asp Gly Thr 255 260 265 270 ggg aaa tca gga acc gtg gaa gag gtg gtt gtt ttg gcc gtg ggg gta 927 Gly Lys Ser Gly Thr Val Glu Glu Val Val Val Leu Ala Val Gly Val 275 280 285 gtg gct gtg aag atc ctg ctt ctc tgc ctt tgc ctc atc atc ctc agt 975 Val Ala Val Lys Ile Leu Leu Leu Cys Leu Cys Leu Ile Ile Leu Ser 290 295 300 ttc cac aag aag aag gcg gtg agg gca gtg gag gtt gag gag aat gta 1023 Phe His Lys Lys Lys Ala Val Arg Ala Val Glu Val Glu Glu Asn Val 305 310 315 tat gct gtc atg ggt taa tctctcaggc ctccagactg tacttccaga 1071 Tyr Ala Val Met Gly 320 tgtctcctca tccagttcct ccacagtctg aatggccatg tttcttcttc attgctggag 1131 aatgaagtgc aaatgccact gcctggactg aaggcctttc acgatctgtc ttctgctgga 1191 ctctgctcct gatccccctt ctccttgcat cacccgaagt ctccctacac ccaccaggcc 1251 aagccctctg tgattctgag actttgcatg tgtagttact tctcctgaaa tggccttcct 1311 ccccattcct gccaatccag gtccttatca tccttcaggt tgtcttaaat gtcatccagg 1371 tgtgtgtatt tttatgtaat ccttgtatga tattaagcgg agatgtggca tttgttcatt 1431 aatttgtaga catattcagt aaccatactg aatacatata atgactatgt gccagcattt 1491 ccgtatgtgc aagaagttca tcaatagata tagactcaaa gagctctgtc atcaagctgt 1551 tgttctgaag agcagaagga tacaaataaa aagaaataag taaaa 1596 2 323 PRT Homo sapiens 2 Met Leu Leu Pro Thr Ala Leu Phe Ser Ala His Thr Gly Ser Pro Gly 1 5 10 15 Ser Leu Cys Leu Arg Asp Ala Ala Ala Ala Ala Ala Ala Ala Pro Ala 20 25 30 Val Gly Arg Ala Leu Ala Gln Asp Ala Arg Phe Arg Leu Glu Met Pro 35 40 45 Glu Ser Val Thr Val Gln Glu Gly Leu Cys Ile Phe Val His Cys Ser 50 55 60 Val Phe Tyr Leu Glu Tyr Gly Trp Lys Asp Ser Thr Pro Ala Tyr Gly 65 70 75 80 His Trp Phe Arg Glu Gly Val Ser Val Asp Gln Glu Thr Pro Val Ala 85 90 95 Thr Asn Asn Ser Thr Gln Lys Val Gln Lys Glu Thr Gln Gly Arg Phe 100 105 110 His Leu Leu Gly Asp Pro Ser Arg Asn Asn Cys Ser Leu Ser Ile Arg 115 120 125 Asp Ala Arg Arg Arg Asp Asn Gly Ser Tyr Phe Phe Trp Val Ala Arg 130 135 140 Gly Arg Thr Lys Phe Ser Tyr Lys Tyr Ser Pro Leu Ser Val Tyr Val 145 150 155 160 Thr Ala Leu Thr His Arg Pro Asp Ile Leu Ile Pro Glu Phe Leu Lys 165 170 175 Ser Gly His Pro Ser Asn Leu Thr Cys Ser Val Pro Trp Val Cys Glu 180 185 190 Gln Gly Thr Pro Pro Ile Phe Ser Trp Met Ser Ala Ala Pro Thr Ser 195 200 205 Leu Gly Pro Arg Thr Leu His Ser Ser Val Leu Thr Ile Ile Pro Arg 210 215 220 Pro Gln Asp His Gly Thr Asn Leu Ile Cys Gln Val Thr Phe Pro Gly 225 230 235 240 Ala Gly Val Thr Thr Glu Arg Thr Ile Gln Leu Ser Val Ser Tyr Pro 245 250 255 Leu Met Asn Pro Val Ile Glu Ala Ser Leu Glu Asp Gly Thr Gly Lys 260 265 270 Ser Gly Thr Val Glu Glu Val Val Val Leu Ala Val Gly Val Val Ala 275 280 285 Val Lys Ile Leu Leu Leu Cys Leu Cys Leu Ile Ile Leu Ser Phe His 290 295 300 Lys Lys Lys Ala Val Arg Ala Val Glu Val Glu Glu Asn Val Tyr Ala 305 310 315 320 Val Met Gly 3 6912 DNA Homo sapiens CDS (1)..(3456) 3 atg agg aaa ttc aac atc agg aag gtg ctg gac ggc ctg acc gcc ggc 48 Met Arg Lys Phe Asn Ile Arg Lys Val Leu Asp Gly Leu Thr Ala Gly 1 5 10 15 tcg tcc tcg gcg tcg cag cag caa cag cag cag cat ccg cct ggg aac 96 Ser Ser Ser Ala Ser Gln Gln Gln Gln Gln Gln His Pro Pro Gly Asn 20 25 30 cgg gag ccg gag atc cag gaa acg ctc cag tcc gag cac ttt cag ctc 144 Arg Glu Pro Glu Ile Gln Glu Thr Leu Gln Ser Glu His Phe Gln Leu 35 40 45 tgc aag act gtt cgc cat gga ttt ccc tat caa ccc tca gcc ctg gcc 192 Cys Lys Thr Val Arg His Gly Phe Pro Tyr Gln Pro Ser Ala Leu Ala 50 55 60 ttt gat cct gta cag aag atc ctg gca gtg gga act cag act ggt gct 240 Phe Asp Pro Val Gln Lys Ile Leu Ala Val Gly Thr Gln Thr Gly Ala 65 70 75 80 tta agg ctc ttt ggt cgt cca gga gta gaa tgt tat tgc cag cat gac 288 Leu Arg Leu Phe Gly Arg Pro Gly Val Glu Cys Tyr Cys Gln His Asp 85 90 95 agt gga gct gca gta atc cag ctc cag ttc ctg att aat gag gga gcg 336 Ser Gly Ala Ala Val Ile Gln Leu Gln Phe Leu Ile Asn Glu Gly Ala 100 105 110 ctt gtg agt gcc ttg gct gat gac acc tta cac tta tgg aat tta cgt 384 Leu Val Ser Ala Leu Ala Asp Asp Thr Leu His Leu Trp Asn Leu Arg 115 120 125 cag aag agg cct gcc ata cta cat tcg ctt aaa ttt tgc aga gaa agg 432 Gln Lys Arg Pro Ala Ile Leu His Ser Leu Lys Phe Cys Arg Glu Arg 130 135 140 gtt aca ttt tgc cat ctg cct ttc cag agt aag tgg ctc tat gtg ggc 480 Val Thr Phe Cys His Leu Pro Phe Gln Ser Lys Trp Leu Tyr Val Gly 145 150 155 160 act gaa cga ggt aat ata cat att gtc aat gtg gag tcc ttc aca ctc 528 Thr Glu Arg Gly Asn Ile His Ile Val Asn Val Glu Ser Phe Thr Leu 165 170 175 tca ggc tac gtc att atg tgg aat aaa gcc att gaa ctg tca tct aaa 576 Ser Gly Tyr Val Ile Met Trp Asn Lys Ala Ile Glu Leu Ser Ser Lys 180 185 190 tct cac cca gga cct gtg gtc cat ata agt gat aat cca atg gac gag 624 Ser His Pro Gly Pro Val Val His Ile Ser Asp Asn Pro Met Asp Glu 195 200 205 gga aag ctt ttg att ggc ttt gaa tct gga aca gta gtt tta tgg gac 672 Gly Lys Leu Leu Ile Gly Phe Glu Ser Gly Thr Val Val Leu Trp Asp 210 215 220 ctc aaa tca aag aaa gcc gac tac aga tac aca tat gat gag gct atc 720 Leu Lys Ser Lys Lys Ala Asp Tyr Arg Tyr Thr Tyr Asp Glu Ala Ile 225 230 235 240 cac tct gtt gct tgg cat cat gaa gga aaa caa ttt att tgc agt cat 768 His Ser Val Ala Trp His His Glu Gly Lys Gln Phe Ile Cys Ser His 245 250 255 tca gat ggc acc ttg act ata tgg aat gta agg tcc cct gct aaa cca 816 Ser Asp Gly Thr Leu Thr Ile Trp Asn Val Arg Ser Pro Ala Lys Pro 260 265 270 gta cag aca atc act cca cat gga aaa cag tta aag gat ggg aag aag 864 Val Gln Thr Ile Thr Pro His Gly Lys Gln Leu Lys Asp Gly Lys Lys 275 280 285 cca gaa cca tgc aaa cct atc ctc aag gtg gaa ttc aaa acg act aga 912 Pro Glu Pro Cys Lys Pro Ile Leu Lys Val Glu Phe Lys Thr Thr Arg 290 295 300 tct ggg gag cct ttt att att tta tca gga ggt ttg tca tat gat act 960 Ser Gly Glu Pro Phe Ile Ile Leu Ser Gly Gly Leu Ser Tyr Asp Thr 305 310 315 320 gta gga aga aga cct tgc tta aca gtg atg cat ggg aaa agc act gct 1008 Val Gly Arg Arg Pro Cys Leu Thr Val Met His Gly Lys Ser Thr Ala 325 330 335 gtg cta gaa atg gac tat tca att gtt gat ttt cta acg ctg tgt gaa 1056 Val Leu Glu Met Asp Tyr Ser Ile Val Asp Phe Leu Thr Leu Cys Glu 340 345 350 aca cca tac cca aat gat ttt caa gaa cca tat gct gtg gtt gtt ctt 1104 Thr Pro Tyr Pro Asn Asp Phe Gln Glu Pro Tyr Ala Val Val Val Leu 355 360 365 cta gaa aag gat tta gta ctt ata gac ctt gca caa aat gga tat cct 1152 Leu Glu Lys Asp Leu Val Leu Ile Asp Leu Ala Gln Asn Gly Tyr Pro 370 375 380 ata ttt gaa aat ccc tac cct ttg agt ata cat gag tcc cct gtt aca 1200 Ile Phe Glu Asn Pro Tyr Pro Leu Ser Ile His Glu Ser Pro Val Thr 385 390 395 400 tgt tgc gaa tat ttt gcg gat tgt cct gtg gac ctt att cct gca ctt 1248 Cys Cys Glu Tyr Phe Ala Asp Cys Pro Val Asp Leu Ile Pro Ala Leu 405 410 415 tat tct gtt gga gct aga cag aaa cgt caa ggt tac agc aaa aag gaa 1296 Tyr Ser Val Gly Ala Arg Gln Lys Arg Gln Gly Tyr Ser Lys Lys Glu 420 425 430 tgg ccc atc aac gga ggt aat tgg ggc ttg ggt gct caa agt tac cca 1344 Trp Pro Ile Asn Gly Gly Asn Trp Gly Leu Gly Ala Gln Ser Tyr Pro 435 440 445 gaa ata att att aca ggg cat gct gat ggg tca gtt aag ttc tgg gat 1392 Glu Ile Ile Ile Thr Gly His Ala Asp Gly Ser Val Lys Phe Trp Asp 450 455 460 gct tct gca ata act cta caa gta tta tat aag cta aag aca tct aaa 1440 Ala Ser Ala Ile Thr Leu Gln Val Leu Tyr Lys Leu Lys Thr Ser Lys 465 470 475 480 gta ttt gaa aag tca aga aat aaa gat gac agg cca aac aca gac att 1488 Val Phe Glu Lys Ser Arg Asn Lys Asp Asp Arg Pro Asn Thr Asp Ile 485 490 495 gta gat gaa gat cca tat gcc att cag atc atc tcc tgg tgt cca gaa 1536 Val Asp Glu Asp Pro Tyr Ala Ile Gln Ile Ile Ser Trp Cys Pro Glu 500 505 510 agt aga atg ctg tgc atc gct gga gtt tca gct cat gtc att att tat 1584 Ser Arg Met Leu Cys Ile Ala Gly Val Ser Ala His Val Ile Ile Tyr 515 520 525 aga ttc agc aag cag gaa gta atc aca gaa gtc att ccg atg ctt gaa 1632 Arg Phe Ser Lys Gln Glu Val Ile Thr Glu Val Ile Pro Met Leu Glu 530 535 540 gtt cga tta tta tat gag ata aat gat gtg gaa act ccg gag ggt gag 1680 Val Arg Leu Leu Tyr Glu Ile Asn Asp Val Glu Thr Pro Glu Gly Glu 545 550 555 560 cag cca cca cct ttg cca aca ccc gtg gga ggg tcc aac cct cag ccc 1728 Gln Pro Pro Pro Leu Pro Thr Pro Val Gly Gly Ser Asn Pro Gln Pro 565 570 575 atc cct cct cag tct cat cca tct acc agt agc agt tca tct gat ggg 1776 Ile Pro Pro Gln Ser His Pro Ser Thr Ser Ser Ser Ser Ser Asp Gly 580 585 590 ctt cgt gat aat gta cct tgt tta aaa gtt aaa aac tca cca ctt aaa 1824 Leu Arg Asp Asn Val Pro Cys Leu Lys Val Lys Asn Ser Pro Leu Lys 595 600 605 cag tct cca ggt tat caa aca gaa cta gtt att cag ttg gtt tgg gtg 1872 Gln Ser Pro Gly Tyr Gln Thr Glu Leu Val Ile Gln Leu Val Trp Val 610 615 620 ggt gga gaa cca cca caa caa ata acc agc ctg gca gtc aat tct tcc 1920 Gly Gly Glu Pro Pro Gln Gln Ile Thr Ser Leu Ala Val Asn Ser Ser 625 630 635 640 tat gga ctg gtg gtt ttt ggc aat tgc aat ggc att gct atg gtt gac 1968 Tyr Gly Leu Val Val Phe Gly Asn Cys Asn Gly Ile Ala Met Val Asp 645 650 655 tac ctc cag aaa gca gtg ctg ctc aac ctg ggc act att gaa tta tat 2016 Tyr Leu Gln Lys Ala Val Leu Leu Asn Leu Gly Thr Ile Glu Leu Tyr 660 665 670 ggc tct aat gat cct tat cgg aga gaa ccc cga tct cct cgt aaa tct 2064 Gly Ser Asn Asp Pro Tyr Arg Arg Glu Pro Arg Ser Pro Arg Lys Ser 675 680 685 cga cag cct tca gga gcc ggt ctg tgt gat att agt gaa ggg act gtt 2112 Arg Gln Pro Ser Gly Ala Gly Leu Cys Asp Ile Ser Glu Gly Thr Val 690 695 700 gtt cca gag gat cgc tgc aaa tct cca acc tct ggt tct tca tca cca 2160 Val Pro Glu Asp Arg Cys Lys Ser Pro Thr Ser Gly Ser Ser Ser Pro 705 710 715 720 cac aat tca gat gat gaa caa aaa atg aat aat ttt ata gaa aag gtg 2208 His Asn Ser Asp Asp Glu Gln Lys Met Asn Asn Phe Ile Glu Lys Val 725 730 735 aag acc aaa agc aga aag ttt tcc aag atg gta gcc aat gat ata gca 2256 Lys Thr Lys Ser Arg Lys Phe Ser Lys Met Val Ala Asn Asp Ile Ala 740 745 750 aag atg tca agg aag tta agc tta cct act gac cta aag cct gat tta 2304 Lys Met Ser Arg Lys Leu Ser Leu Pro Thr Asp Leu Lys Pro Asp Leu 755 760 765 gat gta aag gat aac tcc ttt agc cga tca cgg agt tca agt gta aca 2352 Asp Val Lys Asp Asn Ser Phe Ser Arg Ser Arg Ser Ser Ser Val Thr 770 775 780 agc att gac aaa gaa tcc cga gaa gcg atc tcc gct ctt cat ttc tgt 2400 Ser Ile Asp Lys Glu Ser Arg Glu Ala Ile Ser Ala Leu His Phe Cys 785 790 795 800 gaa acg ttt act cga aag acg gac tcg tcc cct tcc cct tgt ctg tgg 2448 Glu Thr Phe Thr Arg Lys Thr Asp Ser Ser Pro Ser Pro Cys Leu Trp 805 810 815 gtt gga aca acg cta gga aca gtg ctt gtc att gca ctg aac ctt ccc 2496 Val Gly Thr Thr Leu Gly Thr Val Leu Val Ile Ala Leu Asn Leu Pro 820 825 830 cca ggg gga gag caa aga ctt ctt cag cca gta att gtg tct cca agt 2544 Pro Gly Gly Glu Gln Arg Leu Leu Gln Pro Val Ile Val Ser Pro Ser 835 840 845 ggt act ata ttg agg tta aaa ggt gca atc ttg aga atg gca ttt ctg 2592 Gly Thr Ile Leu Arg Leu Lys Gly Ala Ile Leu Arg Met Ala Phe Leu 850 855 860 gat acc aca ggc tgc tta ata cca cct gcg tat gaa ccc tgg aga gag 2640 Asp Thr Thr Gly Cys Leu Ile Pro Pro Ala Tyr Glu Pro Trp Arg Glu 865 870 875 880 cac aat gtt cct gaa gaa aaa gac gaa aag gag aaa ttg aaa aaa cgg 2688 His Asn Val Pro Glu Glu Lys Asp Glu Lys Glu Lys Leu Lys Lys Arg 885 890 895 cgg cct gtc tca gta tcc ccc tcc tct tct cag gaa att agt gaa aac 2736 Arg Pro Val Ser Val Ser Pro Ser Ser Ser Gln Glu Ile Ser Glu Asn 900 905 910 cag tat gca gtg ata tgt tct gaa aag caa gca aaa gta atc tca ctg 2784 Gln Tyr Ala Val Ile Cys Ser Glu Lys Gln Ala Lys Val Ile Ser Leu 915 920 925 cca acc cag aac tgt gct tat aag caa aat att aca gag acc tcg ttt 2832 Pro Thr Gln Asn Cys Ala Tyr Lys Gln Asn Ile Thr Glu Thr Ser Phe 930 935 940 gtg ctt cgt gga gat att gta gca ttg agt aac agt atc tgc ctt gcc 2880 Val Leu Arg Gly Asp Ile Val Ala Leu Ser Asn Ser Ile Cys Leu Ala 945 950 955 960 tgt ttc tgt gcc aat gga cat ata atg act ttt agt ttg cca agt tta 2928 Cys Phe Cys Ala Asn Gly His Ile Met Thr Phe Ser Leu Pro Ser Leu 965 970 975 aga cct ctg ttg gat gtg tat tac ttg ccc ctt acc aat atg cgg ata 2976 Arg Pro Leu Leu Asp Val Tyr Tyr Leu Pro Leu Thr Asn Met Arg Ile 980 985 990 gcc aga acg ttc tgc ttt acc aac aat gga caa gca tta tac ctt gtt 3024 Ala Arg Thr Phe Cys Phe Thr Asn Asn Gly Gln Ala Leu Tyr Leu Val 995 1000 1005 tca cct aca gaa atc cag aga ctt act tat agt caa gag acc tgt 3069 Ser Pro Thr Glu Ile Gln Arg Leu Thr Tyr Ser Gln Glu Thr Cys 1010 1015 1020 gaa aat ctt cag gaa atg ttg ggt gaa ctc ttc act cct gta gaa 3114 Glu Asn Leu Gln Glu Met Leu Gly Glu Leu Phe Thr Pro Val Glu 1025 1030 1035 aca cct gaa gca cca aac agg gga ttc ttt aaa ggc tta ttt gga 3159 Thr Pro Glu Ala Pro Asn Arg Gly Phe Phe Lys Gly Leu Phe Gly 1040 1045 1050 ggt ggt gca caa tct ctt gac aga gaa gaa cta ttt gga gaa tcg 3204 Gly Gly Ala Gln Ser Leu Asp Arg Glu Glu Leu Phe Gly Glu Ser 1055 1060 1065 tcc tca gga aag gct tca agg agc ctt gca cag cat att cct ggc 3249 Ser Ser Gly Lys Ala Ser Arg Ser Leu Ala Gln His Ile Pro Gly 1070 1075 1080 cct ggt ggc att gaa ggc gta aaa ggg gca gca tct gga gtt gtt 3294 Pro Gly Gly Ile Glu Gly Val Lys Gly Ala Ala Ser Gly Val Val 1085 1090 1095 ggt gaa tta gca cga gcc agg ctg gca cta gat gaa aga ggg cag 3339 Gly Glu Leu Ala Arg Ala Arg Leu Ala Leu Asp Glu Arg Gly Gln 1100 1105 1110 aaa ctt ggc gat ctg gaa gaa aga act gcg gcc atg tta tca agt 3384 Lys Leu Gly Asp Leu Glu Glu Arg Thr Ala Ala Met Leu Ser Ser 1115 1120 1125 gca gag tca ttt tct aaa cat gct cat gag att atg ttg aaa tac 3429 Ala Glu Ser Phe Ser Lys His Ala His Glu Ile Met Leu Lys Tyr 1130 1135 1140 aaa gat aag aag tgg tac cag ttc tga atgaggaaat tcaacatcag 3476 Lys Asp Lys Lys Trp Tyr Gln Phe 1145 1150 gaaggtgctg gacggcctga ccgccggctc gtcctcggcg tcgcagcagc aacagcagca 3536 gcatccgcct gggaaccggg agccggagat ccaggaaacg ctccagtccg agcactttca 3596 gctctgcaag actgttcgcc atggatttcc ctatcaaccc tcagccctgg cctttgatcc 3656 tgtacagaag atcctggcag tgggaactca gactggtgct ttaaggctct ttggtcgtcc 3716 aggagtagaa tgttattgcc agcatgacag tggagctgca gtaatccagc tccagttcct 3776 gattaatgag ggagcgcttg tgagtgcctt ggctgatgac accttacact tatggaattt 3836 acgtcagaag aggcctgcca tactacattc gcttaaattt tgcagagaaa gggttacatt 3896 ttgccatctg cctttccaga gtaagtggct ctatgtgggc actgaacgag gtaatataca 3956 tattgtcaat gtggagtcct tcacactctc aggctacgtc attatgtgga ataaagccat 4016 tgaactgtca tctaaatctc acccaggacc tgtggtccat ataagtgata atccaatgga 4076 cgagggaaag cttttgattg gctttgaatc tggaacagta gttttatggg acctcaaatc 4136 aaagaaagcc gactacagat acacatatga tgaggctatc cactctgttg cttggcatca 4196 tgaaggaaaa caatttattt gcagtcattc agatggcacc ttgactatat ggaatgtaag 4256 gtcccctgct aaaccagtac agacaatcac tccacatgga aaacagttaa aggatgggaa 4316 gaagccagaa ccatgcaaac ctatcctcaa ggtggaattc aaaacgacta gatctgggga 4376 gccttttatt attttatcag gaggtttgtc atatgatact gtaggaagaa gaccttgctt 4436 aacagtgatg catgggaaaa gcactgctgt gctagaaatg gactattcaa ttgttgattt 4496 tctaacgctg tgtgaaacac catacccaaa tgattttcaa gaaccatatg ctgtggttgt 4556 tcttctagaa aaggatttag tacttataga ccttgcacaa aatggatatc ctatatttga 4616 aaatccctac cctttgagta tacatgagtc ccctgttaca tgttgcgaat attttgcgga 4676 ttgtcctgtg gaccttattc ctgcacttta ttctgttgga gctagacaga aacgtcaagg 4736 ttacagcaaa aaggaatggc ccatcaacgg aggtaattgg ggcttgggtg ctcaaagtta 4796 cccagaaata attattacag ggcatgctga tgggtcagtt aagttctggg atgcttctgc 4856 aataactcta caagtattat ataagctaaa gacatctaaa gtatttgaaa agtcaagaaa 4916 taaagatgac aggccaaaca cagacattgt agatgaagat ccatatgcca ttcagatcat 4976 ctcctggtgt ccagaaagta gaatgctgtg catcgctgga gtttcagctc atgtcattat 5036 ttatagattc agcaagcagg aagtaatcac agaagtcatt ccgatgcttg aagttcgatt 5096 attatatgag ataaatgatg tggaaactcc ggagggtgag cagccaccac ctttgccaac 5156 acccgtggga gggtccaacc ctcagcccat ccctcctcag tctcatccat ctaccagtag 5216 cagttcatct gatgggcttc gtgataatgt accttgttta aaagttaaaa actcaccact 5276 taaacagtct ccaggttatc aaacagaact agttattcag ttggtttggg tgggtggaga 5336 accaccacaa caaataacca gcctggcagt caattcttcc tatggactgg tggtttttgg 5396 caattgcaat ggcattgcta tggttgacta cctccagaaa gcagtgctgc tcaacctggg 5456 cactattgaa ttatatggct ctaatgatcc ttatcggaga gaaccccgat ctcctcgtaa 5516 atctcgacag ccttcaggag ccggtctgtg tgatattagt gaagggactg ttgttccaga 5576 ggatcgctgc aaatctccaa cctctggttc ttcatcacca cacaattcag atgatgaaca 5636 aaaaatgaat aattttatag aaaaggtgaa gaccaaaagc agaaagtttt ccaagatggt 5696 agccaatgat atagcaaaga tgtcaaggaa gttaagctta cctactgacc taaagcctga 5756 tttagatgta aaggataact cctttagccg atcacggagt tcaagtgtaa caagcattga 5816 caaagaatcc cgagaagcga tctccgctct tcatttctgt gaaacgttta ctcgaaagac 5876 ggactcgtcc ccttcccctt gtctgtgggt tggaacaacg ctaggaacag tgcttgtcat 5936 tgcactgaac cttcccccag ggggagagca aagacttctt cagccagtaa ttgtgtctcc 5996 aagtggtact atattgaggt taaaaggtgc aatcttgaga atggcatttc tggataccac 6056 aggctgctta ataccacctg cgtatgaacc ctggagagag cacaatgttc ctgaagaaaa 6116 agacgaaaag gagaaattga aaaaacggcg gcctgtctca gtatccccct cctcttctca 6176 ggaaattagt gaaaaccagt atgcagtgat atgttctgaa aagcaagcaa aagtaatctc 6236 actgccaacc cagaactgtg cttataagca aaatattaca gagacctcgt ttgtgcttcg 6296 tggagatatt gtagcattga gtaacagtat ctgccttgcc tgtttctgtg ccaatggaca 6356 tataatgact tttagtttgc caagtttaag acctctgttg gatgtgtatt acttgcccct 6416 taccaatatg cggatagcca gaacgttctg ctttaccaac aatggacaag cattatacct 6476 tgtttcacct acagaaatcc agagacttac ttatagtcaa gagacctgtg aaaatcttca 6536 ggaaatgttg ggtgaactct tcactcctgt agaaacacct gaagcaccaa acaggggatt 6596 ctttaaaggc ttatttggag gtggtgcaca atctcttgac agagaagaac tatttggaga 6656 atcgtcctca ggaaaggctt caaggagcct tgcacagcat attcctggcc ctggtggcat 6716 tgaaggcgta aaaggggcag catctggagt tgttggtgaa ttagcacgag ccaggctggc 6776 actagatgaa agagggcaga aacttggcga tctggaagaa agaactgcgg ccatgttatc 6836 aagtgcagag tcattttcta aacatgctca tgagattatg ttgaaataca aagataagaa 6896 gtggtaccag ttctga 6912 4 1151 PRT Homo sapiens 4 Met Arg Lys Phe Asn Ile Arg Lys Val Leu Asp Gly Leu Thr Ala Gly 1 5 10 15 Ser Ser Ser Ala Ser Gln Gln Gln Gln Gln Gln His Pro Pro Gly Asn 20 25 30 Arg Glu Pro Glu Ile Gln Glu Thr Leu Gln Ser Glu His Phe Gln Leu 35 40 45 Cys Lys Thr Val Arg His Gly Phe Pro Tyr Gln Pro Ser Ala Leu Ala 50 55 60 Phe Asp Pro Val Gln Lys Ile Leu Ala Val Gly Thr Gln Thr Gly Ala 65 70 75 80 Leu Arg Leu Phe Gly Arg Pro Gly Val Glu Cys Tyr Cys Gln His Asp 85 90 95 Ser Gly Ala Ala Val Ile Gln Leu Gln Phe Leu Ile Asn Glu Gly Ala 100 105 110 Leu Val Ser Ala Leu Ala Asp Asp Thr Leu His Leu Trp Asn Leu Arg 115 120 125 Gln Lys Arg Pro Ala Ile Leu His Ser Leu Lys Phe Cys Arg Glu Arg 130 135 140 Val Thr Phe Cys His Leu Pro Phe Gln Ser Lys Trp Leu Tyr Val Gly 145 150 155 160 Thr Glu Arg Gly Asn Ile His Ile Val Asn Val Glu Ser Phe Thr Leu 165 170 175 Ser Gly Tyr Val Ile Met Trp Asn Lys Ala Ile Glu Leu Ser Ser Lys 180 185 190 Ser His Pro Gly Pro Val Val His Ile Ser Asp Asn Pro Met Asp Glu 195 200 205 Gly Lys Leu Leu Ile Gly Phe Glu Ser Gly Thr Val Val Leu Trp Asp 210 215 220 Leu Lys Ser Lys Lys Ala Asp Tyr Arg Tyr Thr Tyr Asp Glu Ala Ile 225 230 235 240 His Ser Val Ala Trp His His Glu Gly Lys Gln Phe Ile Cys Ser His 245 250 255 Ser Asp Gly Thr Leu Thr Ile Trp Asn Val Arg Ser Pro Ala Lys Pro 260 265 270 Val Gln Thr Ile Thr Pro His Gly Lys Gln Leu Lys Asp Gly Lys Lys 275 280 285 Pro Glu Pro Cys Lys Pro Ile Leu Lys Val Glu Phe Lys Thr Thr Arg 290 295 300 Ser Gly Glu Pro Phe Ile Ile Leu Ser Gly Gly Leu Ser Tyr Asp Thr 305 310 315 320 Val Gly Arg Arg Pro Cys Leu Thr Val Met His Gly Lys Ser Thr Ala 325 330 335 Val Leu Glu Met Asp Tyr Ser Ile Val Asp Phe Leu Thr Leu Cys Glu 340 345 350 Thr Pro Tyr Pro Asn Asp Phe Gln Glu Pro Tyr Ala Val Val Val Leu 355 360 365 Leu Glu Lys Asp Leu Val Leu Ile Asp Leu Ala Gln Asn Gly Tyr Pro 370 375 380 Ile Phe Glu Asn Pro Tyr Pro Leu Ser Ile His Glu Ser Pro Val Thr 385 390 395 400 Cys Cys Glu Tyr Phe Ala Asp Cys Pro Val Asp Leu Ile Pro Ala Leu 405 410 415 Tyr Ser Val Gly Ala Arg Gln Lys Arg Gln Gly Tyr Ser Lys Lys Glu 420 425 430 Trp Pro Ile Asn Gly Gly Asn Trp Gly Leu Gly Ala Gln Ser Tyr Pro 435 440 445 Glu Ile Ile Ile Thr Gly His Ala Asp Gly Ser Val Lys Phe Trp Asp 450 455 460 Ala Ser Ala Ile Thr Leu Gln Val Leu Tyr Lys Leu Lys Thr Ser Lys 465 470 475 480 Val Phe Glu Lys Ser Arg Asn Lys Asp Asp Arg Pro Asn Thr Asp Ile 485 490 495 Val Asp Glu Asp Pro Tyr Ala Ile Gln Ile Ile Ser Trp Cys Pro Glu 500 505 510 Ser Arg Met Leu Cys Ile Ala Gly Val Ser Ala His Val Ile Ile Tyr 515 520 525 Arg Phe Ser Lys Gln Glu Val Ile Thr Glu Val Ile Pro Met Leu Glu 530 535 540 Val Arg Leu Leu Tyr Glu Ile Asn Asp Val Glu Thr Pro Glu Gly Glu 545 550 555 560 Gln Pro Pro Pro Leu Pro Thr Pro Val Gly Gly Ser Asn Pro Gln Pro 565 570 575 Ile Pro Pro Gln Ser His Pro Ser Thr Ser Ser Ser Ser Ser Asp Gly 580 585 590 Leu Arg Asp Asn Val Pro Cys Leu Lys Val Lys Asn Ser Pro Leu Lys 595 600 605 Gln Ser Pro Gly Tyr Gln Thr Glu Leu Val Ile Gln Leu Val Trp Val 610 615 620 Gly Gly Glu Pro Pro Gln Gln Ile Thr Ser Leu Ala Val Asn Ser Ser 625 630 635 640 Tyr Gly Leu Val Val Phe Gly Asn Cys Asn Gly Ile Ala Met Val Asp 645 650 655 Tyr Leu Gln Lys Ala Val Leu Leu Asn Leu Gly Thr Ile Glu Leu Tyr 660 665 670 Gly Ser Asn Asp Pro Tyr Arg Arg Glu Pro Arg Ser Pro Arg Lys Ser 675 680 685 Arg Gln Pro Ser Gly Ala Gly Leu Cys Asp Ile Ser Glu Gly Thr Val 690 695 700 Val Pro Glu Asp Arg Cys Lys Ser Pro Thr Ser Gly Ser Ser Ser Pro 705 710 715 720 His Asn Ser Asp Asp Glu Gln Lys Met Asn Asn Phe Ile Glu Lys Val 725 730 735 Lys Thr Lys Ser Arg Lys Phe Ser Lys Met Val Ala Asn Asp Ile Ala 740 745 750 Lys Met Ser Arg Lys Leu Ser Leu Pro Thr Asp Leu Lys Pro Asp Leu 755 760 765 Asp Val Lys Asp Asn Ser Phe Ser Arg Ser Arg Ser Ser Ser Val Thr 770 775 780 Ser Ile Asp Lys Glu Ser Arg Glu Ala Ile Ser Ala Leu His Phe Cys 785 790 795 800 Glu Thr Phe Thr Arg Lys Thr Asp Ser Ser Pro Ser Pro Cys Leu Trp 805 810 815 Val Gly Thr Thr Leu Gly Thr Val Leu Val Ile Ala Leu Asn Leu Pro 820 825 830 Pro Gly Gly Glu Gln Arg Leu Leu Gln Pro Val Ile Val Ser Pro Ser 835 840 845 Gly Thr Ile Leu Arg Leu Lys Gly Ala Ile Leu Arg Met Ala Phe Leu 850 855 860 Asp Thr Thr Gly Cys Leu Ile Pro Pro Ala Tyr Glu Pro Trp Arg Glu 865 870 875 880 His Asn Val Pro Glu Glu Lys Asp Glu Lys Glu Lys Leu Lys Lys Arg 885 890 895 Arg Pro Val Ser Val Ser Pro Ser Ser Ser Gln Glu Ile Ser Glu Asn 900 905 910 Gln Tyr Ala Val Ile Cys Ser Glu Lys Gln Ala Lys Val Ile Ser Leu 915 920 925 Pro Thr Gln Asn Cys Ala Tyr Lys Gln Asn Ile Thr Glu Thr Ser Phe 930 935 940 Val Leu Arg Gly Asp Ile Val Ala Leu Ser Asn Ser Ile Cys Leu Ala 945 950 955 960 Cys Phe Cys Ala Asn Gly His Ile Met Thr Phe Ser Leu Pro Ser Leu 965 970 975 Arg Pro Leu Leu Asp Val Tyr Tyr Leu Pro Leu Thr Asn Met Arg Ile 980 985 990 Ala Arg Thr Phe Cys Phe Thr Asn Asn Gly Gln Ala Leu Tyr Leu Val 995 1000 1005 Ser Pro Thr Glu Ile Gln Arg Leu Thr Tyr Ser Gln Glu Thr Cys 1010 1015 1020 Glu Asn Leu Gln Glu Met Leu Gly Glu Leu Phe Thr Pro Val Glu 1025 1030 1035 Thr Pro Glu Ala Pro Asn Arg Gly Phe Phe Lys Gly Leu Phe Gly 1040 1045 1050 Gly Gly Ala Gln Ser Leu Asp Arg Glu Glu Leu Phe Gly Glu Ser 1055 1060 1065 Ser Ser Gly Lys Ala Ser Arg Ser Leu Ala Gln His Ile Pro Gly 1070 1075 1080 Pro Gly Gly Ile Glu Gly Val Lys Gly Ala Ala Ser Gly Val Val 1085 1090 1095 Gly Glu Leu Ala Arg Ala Arg Leu Ala Leu Asp Glu Arg Gly Gln 1100 1105 1110 Lys Leu Gly Asp Leu Glu Glu Arg Thr Ala Ala Met Leu Ser Ser 1115 1120 1125 Ala Glu Ser Phe Ser Lys His Ala His Glu Ile Met Leu Lys Tyr 1130 1135 1140 Lys Asp Lys Lys Trp Tyr Gln Phe 1145 1150 5 3348 DNA Homo sapiens CDS (1)..(3348) 5 atg agg aaa ttc aac atc agg aag gtg ctg gac ggc ctg acc gcc ggc 48 Met Arg Lys Phe Asn Ile Arg Lys Val Leu Asp Gly Leu Thr Ala Gly 1 5 10 15 tcg tcc tcg gcg tcg cag cag caa cag cag cag cat ccg cct ggg aac 96 Ser Ser Ser Ala Ser Gln Gln Gln Gln Gln Gln His Pro Pro Gly Asn 20 25 30 cgg gag ccg gag atc cag gaa acg ctc cag tcc gag cac ttt cag ctc 144 Arg Glu Pro Glu Ile Gln Glu Thr Leu Gln Ser Glu His Phe Gln Leu 35 40 45 tgc aag act gtt cgc cat gga ttt ccc tat caa ccc tca gcc ctg gcc 192 Cys Lys Thr Val Arg His Gly Phe Pro Tyr Gln Pro Ser Ala Leu Ala 50 55 60 ttt gat cct gta cag aag atc ctg gca gtg gga act cag act ggt gct 240 Phe Asp Pro Val Gln Lys Ile Leu Ala Val Gly Thr Gln Thr Gly Ala 65 70 75 80 tta agg ctc ttt ggt cgt cca gga gta gaa tgt tat tgc cag cat gac 288 Leu Arg Leu Phe Gly Arg Pro Gly Val Glu Cys Tyr Cys Gln His Asp 85 90 95 agt gga gct gca gta atc cag ctc cag ttc ctg att aat gag gga gcg 336 Ser Gly Ala Ala Val Ile Gln Leu Gln Phe Leu Ile Asn Glu Gly Ala 100 105 110 ctt gtg agt gcc ttg gct gat gac acc tta cac tta tgg aat tta cgt 384 Leu Val Ser Ala Leu Ala Asp Asp Thr Leu His Leu Trp Asn Leu Arg 115 120 125 cag aag agg cct gcc ata cta cat tcg ctt aaa ttt tgc aga gaa agg 432 Gln Lys Arg Pro Ala Ile Leu His Ser Leu Lys Phe Cys Arg Glu Arg 130 135 140 gtt aca ttt tgc cat ctg cct ttc cag agt aag tgg ctc tat gtg ggc 480 Val Thr Phe Cys His Leu Pro Phe Gln Ser Lys Trp Leu Tyr Val Gly 145 150 155 160 act gaa cga ggt aat ata cat att gtc aat gtg gag tcc ttc aca ctc 528 Thr Glu Arg Gly Asn Ile His Ile Val Asn Val Glu Ser Phe Thr Leu 165 170 175 tca ggc tac gtc att atg tgg aat aaa gcc att gaa ctg tca tct aaa 576 Ser Gly Tyr Val Ile Met Trp Asn Lys Ala Ile Glu Leu Ser Ser Lys 180 185 190 tct cac cca gga cct gtg gtc cat ata agt gat aat cca atg gac gag 624 Ser His Pro Gly Pro Val Val His Ile Ser Asp Asn Pro Met Asp Glu 195 200 205 gga aag ctt ttg att ggc ttt gaa tct gga aca gta gtt tta tgg gac 672 Gly Lys Leu Leu Ile Gly Phe Glu Ser Gly Thr Val Val Leu Trp Asp 210 215 220 ctc aaa tca aag aaa gcc gac tac aga tac aca tat gat gag gct atc 720 Leu Lys Ser Lys Lys Ala Asp Tyr Arg Tyr Thr Tyr Asp Glu Ala Ile 225 230 235 240 cac tct gtt gct tgg cat cat gaa gga aaa caa ttt att tgc agt cat 768 His Ser Val Ala Trp His His Glu Gly Lys Gln Phe Ile Cys Ser His 245 250 255 tca gat ggc acc ttg act ata tgg aat gta agg tcc cct gct aaa cca 816 Ser Asp Gly Thr Leu Thr Ile Trp Asn Val Arg Ser Pro Ala Lys Pro 260 265 270 gta cag aca atc act cca cat gga aaa cag tta aag gat ggg aag aag 864 Val Gln Thr Ile Thr Pro His Gly Lys Gln Leu Lys Asp Gly Lys Lys 275 280 285 cca gaa cca tgc aaa cct atc ctc aag gtg gaa ttc aaa acg act aga 912 Pro Glu Pro Cys Lys Pro Ile Leu Lys Val Glu Phe Lys Thr Thr Arg 290 295 300 tct ggg gag cct ttt att att tta tca gga ggt ttg tca tat gat act 960 Ser Gly Glu Pro Phe Ile Ile Leu Ser Gly Gly Leu Ser Tyr Asp Thr 305 310 315 320 gta gga aga aga cct tgc tta aca gtg atg cat ggg aaa agc act gct 1008 Val Gly Arg Arg Pro Cys Leu Thr Val Met His Gly Lys Ser Thr Ala 325 330 335 gtg cta gaa atg gac tat tca att gtt gat ttt cta acg ctg tgt gaa 1056 Val Leu Glu Met Asp Tyr Ser Ile Val Asp Phe Leu Thr Leu Cys Glu 340 345 350 aca cca tac cca aat gat ttt caa gaa cca tat gct gtg gtt gtt ctt 1104 Thr Pro Tyr Pro Asn Asp Phe Gln Glu Pro Tyr Ala Val Val Val Leu 355 360 365 cta gaa aag gat tta gta ctt ata gac ctt gca caa aat gga tat cct 1152 Leu Glu Lys Asp Leu Val Leu Ile Asp Leu Ala Gln Asn Gly Tyr Pro 370 375 380 ata ttt gaa aat ccc tac cct ttg agt ata cat gag tcc cct gtt aca 1200 Ile Phe Glu Asn Pro Tyr Pro Leu Ser Ile His Glu Ser Pro Val Thr 385 390 395 400 tgt tgc gaa tat ttt gcg gat tgt cct gtg gac ctt att cct gca ctt 1248 Cys Cys Glu Tyr Phe Ala Asp Cys Pro Val Asp Leu Ile Pro Ala Leu 405 410 415 tat tct gtt gga gct aga cag aaa cgt caa ggt tac agc aaa aag gaa 1296 Tyr Ser Val Gly Ala Arg Gln Lys Arg Gln Gly Tyr Ser Lys Lys Glu 420 425 430 tgg ccc atc aac gga ggt aat tgg ggc ttg ggt gct caa agt tac cca 1344 Trp Pro Ile Asn Gly Gly Asn Trp Gly Leu Gly Ala Gln Ser Tyr Pro 435 440 445 gaa ata att att aca ggg cat gct gat ggg tca gtt aag ttc tgg gat 1392 Glu Ile Ile Ile Thr Gly His Ala Asp Gly Ser Val Lys Phe Trp Asp 450 455 460 gct tct gca ata act cta caa gta tta tat aag cta aag aca tct aaa 1440 Ala Ser Ala Ile Thr Leu Gln Val Leu Tyr Lys Leu Lys Thr Ser Lys 465 470 475 480 gta ttt gaa aag tca aga aat aaa gat gac agg cca aac aca gac att 1488 Val Phe Glu Lys Ser Arg Asn Lys Asp Asp Arg Pro Asn Thr Asp Ile 485 490 495 gta gat gaa gat cca tat gcc att cag atc atc tcc tgg tgt cca gaa 1536 Val Asp Glu Asp Pro Tyr Ala Ile Gln Ile Ile Ser Trp Cys Pro Glu 500 505 510 agt aga atg ctg tgc atc gct gga gtt tca gct cat gtc att att tat 1584 Ser Arg Met Leu Cys Ile Ala Gly Val Ser Ala His Val Ile Ile Tyr 515 520 525 aga ttc agc aag cag gaa gta atc aca gaa gtc att ccg atg ctt gaa 1632 Arg Phe Ser Lys Gln Glu Val Ile Thr Glu Val Ile Pro Met Leu Glu 530 535 540 gtt cga tta tta tat gag ata aat gat gtg gaa act ccg gag ggt gag 1680 Val Arg Leu Leu Tyr Glu Ile Asn Asp Val Glu Thr Pro Glu Gly Glu 545 550 555 560 cag cca cca cct ttg cca aca ccc gtg gga ggg tcc aac cct cag ccc 1728 Gln Pro Pro Pro Leu Pro Thr Pro Val Gly Gly Ser Asn Pro Gln Pro 565 570 575 atc cct cct cag tct cat cca tct acc agt agc agt tca tct gat ggg 1776 Ile Pro Pro Gln Ser His Pro Ser Thr Ser Ser Ser Ser Ser Asp Gly 580 585 590 ctt cgt gat aat gta cct tgt tta aaa gtt aaa aac tca cca ctt aaa 1824 Leu Arg Asp Asn Val Pro Cys Leu Lys Val Lys Asn Ser Pro Leu Lys 595 600 605 cag tct cca ggt tat caa aca gaa cta gtt att cag ttg gtt tgg gtg 1872 Gln Ser Pro Gly Tyr Gln Thr Glu Leu Val Ile Gln Leu Val Trp Val 610 615 620 ggt gga gaa cca cca caa caa ata acc agc ctg gca gtc aat tct tcc 1920 Gly Gly Glu Pro Pro Gln Gln Ile Thr Ser Leu Ala Val Asn Ser Ser 625 630 635 640 tat gga ctg gtg gtt ttt ggc aat tgc aat ggc att gct atg gtt gac 1968 Tyr Gly Leu Val Val Phe Gly Asn Cys Asn Gly Ile Ala Met Val Asp 645 650 655 tac ctc cag aaa gca gtg ctg ctc aac ctg ggc act att gaa tta tat 2016 Tyr Leu Gln Lys Ala Val Leu Leu Asn Leu Gly Thr Ile Glu Leu Tyr 660 665 670 ggc tct aat gat cct tat cgg aga gaa ccc cga tct cct cgt aaa tct 2064 Gly Ser Asn Asp Pro Tyr Arg Arg Glu Pro Arg Ser Pro Arg Lys Ser 675 680 685 cga cag cct tca gga gcc ggt ctg tgt gat att agt gaa ggg act gtt 2112 Arg Gln Pro Ser Gly Ala Gly Leu Cys Asp Ile Ser Glu Gly Thr Val 690 695 700 gtt cca gag gat cgc tgc aaa tct cca acc tct gca aag atg tca agg 2160 Val Pro Glu Asp Arg Cys Lys Ser Pro Thr Ser Ala Lys Met Ser Arg 705 710 715 720 aag tta agc tta cct act gac cta aag cct gat tta gat gta aag gat 2208 Lys Leu Ser Leu Pro Thr Asp Leu Lys Pro Asp Leu Asp Val Lys Asp 725 730 735 aac tcc ttt agc cga tca cgg agt tca agt gta aca agc att gac aaa 2256 Asn Ser Phe Ser Arg Ser Arg Ser Ser Ser Val Thr Ser Ile Asp Lys 740 745 750 gaa tcc cga gaa gcg atc tcc gct ctt cat ttc tgt gaa acg ttt act 2304 Glu Ser Arg Glu Ala Ile Ser Ala Leu His Phe Cys Glu Thr Phe Thr 755 760 765 cga aag acg gac tcg tcc cct tcc cct tgt ctg tgg gtt gga aca acg 2352 Arg Lys Thr Asp Ser Ser Pro Ser Pro Cys Leu Trp Val Gly Thr Thr 770 775 780 cta gga aca gtg ctt gtc att gca ctg aac ctt ccc cca ggg gga gag 2400 Leu Gly Thr Val Leu Val Ile Ala Leu Asn Leu Pro Pro Gly Gly Glu 785 790 795 800 caa aga ctt ctt cag cca gta att gtg tct cca agt ggt act ata ttg 2448 Gln Arg Leu Leu Gln Pro Val Ile Val Ser Pro Ser Gly Thr Ile Leu 805 810 815 agg tta aaa ggt gca atc ttg aga atg gca ttt ctg gat acc aca ggc 2496 Arg Leu Lys Gly Ala Ile Leu Arg Met Ala Phe Leu Asp Thr Thr Gly 820 825 830 tgc tta ata cca cct gcg tat gaa ccc tgg aga gag cac aat gtt cct 2544 Cys Leu Ile Pro Pro Ala Tyr Glu Pro Trp Arg Glu His Asn Val Pro 835 840 845 gaa gaa aaa gac gaa aag gag aaa ttg aaa aaa cgg cgg cct gtc tca 2592 Glu Glu Lys Asp Glu Lys Glu Lys Leu Lys Lys Arg Arg Pro Val Ser 850 855 860 gta tcc ccc tcc tct tct cag gaa att agt gaa aac cag tat gca gtg 2640 Val Ser Pro Ser Ser Ser Gln Glu Ile Ser Glu Asn Gln Tyr Ala Val 865 870 875 880 ata tgt tct gaa aag caa gca aaa gta atc tca ctg cca acc cag aac 2688 Ile Cys Ser Glu Lys Gln Ala Lys Val Ile Ser Leu Pro Thr Gln Asn 885 890 895 tgt gct tat aag caa aat att aca gag acc tcg ttt gtg ctt cgt gga 2736 Cys Ala Tyr Lys Gln Asn Ile Thr Glu Thr Ser Phe Val Leu Arg Gly 900 905 910 gat att gta gca ttg agt aac agt atc tgc ctt gcc tgt ttc tgt gcc 2784 Asp Ile Val Ala Leu Ser Asn Ser Ile Cys Leu Ala Cys Phe Cys Ala 915 920 925 aat gga cat ata atg act ttt agt ttg cca agt tta aga cct ctg ttg 2832 Asn Gly His Ile Met Thr Phe Ser Leu Pro Ser Leu Arg Pro Leu Leu 930 935 940 gat gtg tat tac ttg ccc ctt acc aat atg cgg ata gcc aga acg ttc 2880 Asp Val Tyr Tyr Leu Pro Leu Thr Asn Met Arg Ile Ala Arg Thr Phe 945 950 955 960 tgc ttt acc aac aat gga caa gca tta tac ctt gtt tca cct aca gaa 2928 Cys Phe Thr Asn Asn Gly Gln Ala Leu Tyr Leu Val Ser Pro Thr Glu 965 970 975 atc cag aga ctt act tat agt caa gag acc tgt gaa aat ctt cag gaa 2976 Ile Gln Arg Leu Thr Tyr Ser Gln Glu Thr Cys Glu Asn Leu Gln Glu 980 985 990 atg ttg ggt gaa ctc ttc act cct gta gaa aca cct gaa gca cca aac 3024 Met Leu Gly Glu Leu Phe Thr Pro Val Glu Thr Pro Glu Ala Pro Asn 995 1000 1005 agg gga ttc ttt aaa ggc tta ttt gga ggt ggt gca caa tct ctt 3069 Arg Gly Phe Phe Lys Gly Leu Phe Gly Gly Gly Ala Gln Ser Leu 1010 1015 1020 gac aga gaa gaa cta ttt gga gaa tcg tcc tca gga aag gct tca 3114 Asp Arg Glu Glu Leu Phe Gly Glu Ser Ser Ser Gly Lys Ala Ser 1025 1030 1035 agg agc ctt gca cag cat att cct ggc cct ggt ggc att gaa ggc 3159 Arg Ser Leu Ala Gln His Ile Pro Gly Pro Gly Gly Ile Glu Gly 1040 1045 1050 gta aaa ggg gca gca tct gga gtt gtt ggt gaa tta gca cga gcc 3204 Val Lys Gly Ala Ala Ser Gly Val Val Gly Glu Leu Ala Arg Ala 1055 1060 1065 agg ctg gca cta gat gaa aga ggg cag aaa ctt ggc gat ctg gaa 3249 Arg Leu Ala Leu Asp Glu Arg Gly Gln Lys Leu Gly Asp Leu Glu 1070 1075 1080 gaa aga act gcg gcc atg tta tca agt gca gag tca ttt tct aaa 3294 Glu Arg Thr Ala Ala Met Leu Ser Ser Ala Glu Ser Phe Ser Lys 1085 1090 1095 cat gct cat gag att atg ttg aaa tac aaa gat aag aag tgg tac 3339 His Ala His Glu Ile Met Leu Lys Tyr Lys Asp Lys Lys Trp Tyr 1100 1105 1110 cag ttc tga 3348 Gln Phe 1115 6 1115 PRT Homo sapiens 6 Met Arg Lys Phe Asn Ile Arg Lys Val Leu Asp Gly Leu Thr Ala Gly 1 5 10 15 Ser Ser Ser Ala Ser Gln Gln Gln Gln Gln Gln His Pro Pro Gly Asn 20 25 30 Arg Glu Pro Glu Ile Gln Glu Thr Leu Gln Ser Glu His Phe Gln Leu 35 40 45 Cys Lys Thr Val Arg His Gly Phe Pro Tyr Gln Pro Ser Ala Leu Ala 50 55 60 Phe Asp Pro Val Gln Lys Ile Leu Ala Val Gly Thr Gln Thr Gly Ala 65 70 75 80 Leu Arg Leu Phe Gly Arg Pro Gly Val Glu Cys Tyr Cys Gln His Asp 85 90 95 Ser Gly Ala Ala Val Ile Gln Leu Gln Phe Leu Ile Asn Glu Gly Ala 100 105 110 Leu Val Ser Ala Leu Ala Asp Asp Thr Leu His Leu Trp Asn Leu Arg 115 120 125 Gln Lys Arg Pro Ala Ile Leu His Ser Leu Lys Phe Cys Arg Glu Arg 130 135 140 Val Thr Phe Cys His Leu Pro Phe Gln Ser Lys Trp Leu Tyr Val Gly 145 150 155 160 Thr Glu Arg Gly Asn Ile His Ile Val Asn Val Glu Ser Phe Thr Leu 165 170 175 Ser Gly Tyr Val Ile Met Trp Asn Lys Ala Ile Glu Leu Ser Ser Lys 180 185 190 Ser His Pro Gly Pro Val Val His Ile Ser Asp Asn Pro Met Asp Glu 195 200 205 Gly Lys Leu Leu Ile Gly Phe Glu Ser Gly Thr Val Val Leu Trp Asp 210 215 220 Leu Lys Ser Lys Lys Ala Asp Tyr Arg Tyr Thr Tyr Asp Glu Ala Ile 225 230 235 240 His Ser Val Ala Trp His His Glu Gly Lys Gln Phe Ile Cys Ser His 245 250 255 Ser Asp Gly Thr Leu Thr Ile Trp Asn Val Arg Ser Pro Ala Lys Pro 260 265 270 Val Gln Thr Ile Thr Pro His Gly Lys Gln Leu Lys Asp Gly Lys Lys 275 280 285 Pro Glu Pro Cys Lys Pro Ile Leu Lys Val Glu Phe Lys Thr Thr Arg 290 295 300 Ser Gly Glu Pro Phe Ile Ile Leu Ser Gly Gly Leu Ser Tyr Asp Thr 305 310 315 320 Val Gly Arg Arg Pro Cys Leu Thr Val Met His Gly Lys Ser Thr Ala 325 330 335 Val Leu Glu Met Asp Tyr Ser Ile Val Asp Phe Leu Thr Leu Cys Glu 340 345 350 Thr Pro Tyr Pro Asn Asp Phe Gln Glu Pro Tyr Ala Val Val Val Leu 355 360 365 Leu Glu Lys Asp Leu Val Leu Ile Asp Leu Ala Gln Asn Gly Tyr Pro 370 375 380 Ile Phe Glu Asn Pro Tyr Pro Leu Ser Ile His Glu Ser Pro Val Thr 385 390 395 400 Cys Cys Glu Tyr Phe Ala Asp Cys Pro Val Asp Leu Ile Pro Ala Leu 405 410 415 Tyr Ser Val Gly Ala Arg Gln Lys Arg Gln Gly Tyr Ser Lys Lys Glu 420 425 430 Trp Pro Ile Asn Gly Gly Asn Trp Gly Leu Gly Ala Gln Ser Tyr Pro 435 440 445 Glu Ile Ile Ile Thr Gly His Ala Asp Gly Ser Val Lys Phe Trp Asp 450 455 460 Ala Ser Ala Ile Thr Leu Gln Val Leu Tyr Lys Leu Lys Thr Ser Lys 465 470 475 480 Val Phe Glu Lys Ser Arg Asn Lys Asp Asp Arg Pro Asn Thr Asp Ile 485 490 495 Val Asp Glu Asp Pro Tyr Ala Ile Gln Ile Ile Ser Trp Cys Pro Glu 500 505 510 Ser Arg Met Leu Cys Ile Ala Gly Val Ser Ala His Val Ile Ile Tyr 515 520 525 Arg Phe Ser Lys Gln Glu Val Ile Thr Glu Val Ile Pro Met Leu Glu 530 535 540 Val Arg Leu Leu Tyr Glu Ile Asn Asp Val Glu Thr Pro Glu Gly Glu 545 550 555 560 Gln Pro Pro Pro Leu Pro Thr Pro Val Gly Gly Ser Asn Pro Gln Pro 565 570 575 Ile Pro Pro Gln Ser His Pro Ser Thr Ser Ser Ser Ser Ser Asp Gly 580 585 590 Leu Arg Asp Asn Val Pro Cys Leu Lys Val Lys Asn Ser Pro Leu Lys 595 600 605 Gln Ser Pro Gly Tyr Gln Thr Glu Leu Val Ile Gln Leu Val Trp Val 610 615 620 Gly Gly Glu Pro Pro Gln Gln Ile Thr Ser Leu Ala Val Asn Ser Ser 625 630 635 640 Tyr Gly Leu Val Val Phe Gly Asn Cys Asn Gly Ile Ala Met Val Asp 645 650 655 Tyr Leu Gln Lys Ala Val Leu Leu Asn Leu Gly Thr Ile Glu Leu Tyr 660 665 670 Gly Ser Asn Asp Pro Tyr Arg Arg Glu Pro Arg Ser Pro Arg Lys Ser 675 680 685 Arg Gln Pro Ser Gly Ala Gly Leu Cys Asp Ile Ser Glu Gly Thr Val 690 695 700 Val Pro Glu Asp Arg Cys Lys Ser Pro Thr Ser Ala Lys Met Ser Arg 705 710 715 720 Lys Leu Ser Leu Pro Thr Asp Leu Lys Pro Asp Leu Asp Val Lys Asp 725 730 735 Asn Ser Phe Ser Arg Ser Arg Ser Ser Ser Val Thr Ser Ile Asp Lys 740 745 750 Glu Ser Arg Glu Ala Ile Ser Ala Leu His Phe Cys Glu Thr Phe Thr 755 760 765 Arg Lys Thr Asp Ser Ser Pro Ser Pro Cys Leu Trp Val Gly Thr Thr 770 775 780 Leu Gly Thr Val Leu Val Ile Ala Leu Asn Leu Pro Pro Gly Gly Glu 785 790 795 800 Gln Arg Leu Leu Gln Pro Val Ile Val Ser Pro Ser Gly Thr Ile Leu 805 810 815 Arg Leu Lys Gly Ala Ile Leu Arg Met Ala Phe Leu Asp Thr Thr Gly 820 825 830 Cys Leu Ile Pro Pro Ala Tyr Glu Pro Trp Arg Glu His Asn Val Pro 835 840 845 Glu Glu Lys Asp Glu Lys Glu Lys Leu Lys Lys Arg Arg Pro Val Ser 850 855 860 Val Ser Pro Ser Ser Ser Gln Glu Ile Ser Glu Asn Gln Tyr Ala Val 865 870 875 880 Ile Cys Ser Glu Lys Gln Ala Lys Val Ile Ser Leu Pro Thr Gln Asn 885 890 895 Cys Ala Tyr Lys Gln Asn Ile Thr Glu Thr Ser Phe Val Leu Arg Gly 900 905 910 Asp Ile Val Ala Leu Ser Asn Ser Ile Cys Leu Ala Cys Phe Cys Ala 915 920 925 Asn Gly His Ile Met Thr Phe Ser Leu Pro Ser Leu Arg Pro Leu Leu 930 935 940 Asp Val Tyr Tyr Leu Pro Leu Thr Asn Met Arg Ile Ala Arg Thr Phe 945 950 955 960 Cys Phe Thr Asn Asn Gly Gln Ala Leu Tyr Leu Val Ser Pro Thr Glu 965 970 975 Ile Gln Arg Leu Thr Tyr Ser Gln Glu Thr Cys Glu Asn Leu Gln Glu 980 985 990 Met Leu Gly Glu Leu Phe Thr Pro Val Glu Thr Pro Glu Ala Pro Asn 995 1000 1005 Arg Gly Phe Phe Lys Gly Leu Phe Gly Gly Gly Ala Gln Ser Leu 1010 1015 1020 Asp Arg Glu Glu Leu Phe Gly Glu Ser Ser Ser Gly Lys Ala Ser 1025 1030 1035 Arg Ser Leu Ala Gln His Ile Pro Gly Pro Gly Gly Ile Glu Gly 1040 1045 1050 Val Lys Gly Ala Ala Ser Gly Val Val Gly Glu Leu Ala Arg Ala 1055 1060 1065 Arg Leu Ala Leu Asp Glu Arg Gly Gln Lys Leu Gly Asp Leu Glu 1070 1075 1080 Glu Arg Thr Ala Ala Met Leu Ser Ser Ala Glu Ser Phe Ser Lys 1085 1090 1095 His Ala His Glu Ile Met Leu Lys Tyr Lys Asp Lys Lys Trp Tyr 1100 1105 1110 Gln Phe 1115 7 5184 DNA Homo sapiens CDS (317)..(4096) 7 gacagcacag agctcaggaa cgctgcctga ggaccctggg gcctacgagg aggagaagag 60 ggcaggagcg cttgcagaga ccctgggctc ctatcctgcc ataagcctcg ctgtctcctg 120 atatctgcag ccaggcccta ctgacacccc caggcctgag tgcaagcaga gaccccacca 180 ttcccaggcc ctggaggact ggtccacctt aactgggcag cccttggggc aggcgctggc 240 cggtgcctca gcccaggcct ctgtgctctg catgcactgc cagcctgcca tcaggcctct 300 attgcagccc tgaacc atg atc cag ggc acc ttg gag cca gat ggt ccc ctc 352 Met Ile Gln Gly Thr Leu Glu Pro Asp Gly Pro Leu 1 5 10 tgg ggc tgg gac tgg gac agt gac aat gac tgg gat agt gct gtg ctg 400 Trp Gly Trp Asp Trp Asp Ser Asp Asn Asp Trp Asp Ser Ala Val Leu 15 20 25 gcc ctc ctg gcg ctg gct gtg gtg gct gcc aca gcg ctg gcc tta cac 448 Ala Leu Leu Ala Leu Ala Val Val Ala Ala Thr Ala Leu Ala Leu His 30 35 40 tgg ttt ggc tcc ggg cac gat caa gag gcg gca gaa ccg gtg tcc aca 496 Trp Phe Gly Ser Gly His Asp Gln Glu Ala Ala Glu Pro Val Ser Thr 45 50 55 60 gcc ctc ggg gct caa cct cat cag gca gga gga gct gag ctg gcc ctg 544 Ala Leu Gly Ala Gln Pro His Gln Ala Gly Gly Ala Glu Leu Ala Leu 65 70 75 caa ccg aag tct aag gtc agt gat ggc agc gag ggg cag agc cca ggg 592 Gln Pro Lys Ser Lys Val Ser Asp Gly Ser Glu Gly Gln Ser Pro Gly 80 85 90 cag ggg aaa cca gag ccc cca gga cgc ggc cag cag agc cct gtc cct 640 Gln Gly Lys Pro Glu Pro Pro Gly Arg Gly Gln Gln Ser Pro Val Pro 95 100 105 gct gca gcg ccg ggc ggg ggc ctg gcc gcc atg gcc cgg ctt cca ctc 688 Ala Ala Ala Pro Gly Gly Gly Leu Ala Ala Met Ala Arg Leu Pro Leu 110 115 120 aag acg gct gtc gag gag gcc cgc aga gag gca tta gga cag caa cgg 736 Lys Thr Ala Val Glu Glu Ala Arg Arg Glu Ala Leu Gly Gln Gln Arg 125 130 135 140 ggc agt gcc acc ccc gcg gcc ccc cga gcg gaa gga aag gag cct ccc 784 Gly Ser Ala Thr Pro Ala Ala Pro Arg Ala Glu Gly Lys Glu Pro Pro 145 150 155 agg cca ggc act gcc ctc ctg ggc agg agc gaa gca ggg ggg atg tcc 832 Arg Pro Gly Thr Ala Leu Leu Gly Arg Ser Glu Ala Gly Gly Met Ser 160 165 170 gcc ccc ctc ctg atc cac ttc act cct cgg agc cct ggc agc gaa gcg 880 Ala Pro Leu Leu Ile His Phe Thr Pro Arg Ser Pro Gly Ser Glu Ala 175 180 185 gag gcg gag aca ggt ggt gtc agg gcg tcc tct cgc cag gcc gca ggc 928 Glu Ala Glu Thr Gly Gly Val Arg Ala Ser Ser Arg Gln Ala Ala Gly 190 195 200 ccc gcg ggg caa cag gac act ggc ccc tgg cag gcg ggc gcg ggg ccc 976 Pro Ala Gly Gln Gln Asp Thr Gly Pro Trp Gln Ala Gly Ala Gly Pro 205 210 215 220 tcg ggc tcg atg ggg aga ggc cgg ggc cgg cgg cgg cgg atg gac gct 1024 Ser Gly Ser Met Gly Arg Gly Arg Gly Arg Arg Arg Arg Met Asp Ala 225 230 235 ggc tcg gga gac aga gcc cgc cgc ccc cgg aaa ctg gac ccg ctc cgc 1072 Gly Ser Gly Asp Arg Ala Arg Arg Pro Arg Lys Leu Asp Pro Leu Arg 240 245 250 ctg ggc gcc gcg ggg agc gtg tgg gac gcg gtg gac ggg gcc gcc gcc 1120 Leu Gly Ala Ala Gly Ser Val Trp Asp Ala Val Asp Gly Ala Ala Ala 255 260 265 ctg gac gcc cac gcg cgc ggc ctc ccc aca gga ccc cca ctc gcc cag 1168 Leu Asp Ala His Ala Arg Gly Leu Pro Thr Gly Pro Pro Leu Ala Gln 270 275 280 gag ccc gca ctc ccg gcg ctg ccc gct ccc cgc gcc ctg cag cct ggg 1216 Glu Pro Ala Leu Pro Ala Leu Pro Ala Pro Arg Ala Leu Gln Pro Gly 285 290 295 300 tct cag acg gaa ggc tct ggg gcc aag ggt ggc tgg agc agg gag gcc 1264 Ser Gln Thr Glu Gly Ser Gly Ala Lys Gly Gly Trp Ser Arg Glu Ala 305 310 315 tcg ggg gtc cct gcc ccc gga gga ggc tgg ccc tgg gtc agc agg gag 1312 Ser Gly Val Pro Ala Pro Gly Gly Gly Trp Pro Trp Val Ser Arg Glu 320 325 330 gtc ccg ggc acc cgg agc ttt ggc cca gcc cca ggc tcc acg cgc ccc 1360 Val Pro Gly Thr Arg Ser Phe Gly Pro Ala Pro Gly Ser Thr Arg Pro 335 340 345 tgg cta gag agt ccg cct caa ggt cgc cca ctc tcg tcc caa ggg ccg 1408 Trp Leu Glu Ser Pro Pro Gln Gly Arg Pro Leu Ser Ser Gln Gly Pro 350 355 360 ggt gcc aca ggg gcc tac gat gcc ggc gag gcc ggg gct gac agc tcc 1456 Gly Ala Thr Gly Ala Tyr Asp Ala Gly Glu Ala Gly Ala Asp Ser Ser 365 370 375 380 cga gat aac agt cct gcc gct gac ctg ggg ccc acc cgg ccc ccg gag 1504 Arg Asp Asn Ser Pro Ala Ala Asp Leu Gly Pro Thr Arg Pro Pro Glu 385 390 395 caa gca aag ccg gcc gca gcc ggc cac agc cgc gcg ccc tcc cgg agc 1552 Gln Ala Lys Pro Ala Ala Ala Gly His Ser Arg Ala Pro Ser Arg Ser 400 405 410 cgt gag cct cgc ccg cgc tcc gcc tcc ccg ccc gca gct ccc ggc ccg 1600 Arg Glu Pro Arg Pro Arg Ser Ala Ser Pro Pro Ala Ala Pro Gly Pro 415 420 425 ggg ttc cca cct gaa gcc ctg act ctc ccc tct cct tca gac ttt ttg 1648 Gly Phe Pro Pro Glu Ala Leu Thr Leu Pro Ser Pro Ser Asp Phe Leu 430 435 440 ccc ctg gag gtt acc cag gat cct tcc gtg ggc gaa aat ctc aga gcg 1696 Pro Leu Glu Val Thr Gln Asp Pro Ser Val Gly Glu Asn Leu Arg Ala 445 450 455 460 gcg cca gcc cca agt tca gcc tca gcc caa gtc tta act tca gct cca 1744 Ala Pro Ala Pro Ser Ser Ala Ser Ala Gln Val Leu Thr Ser Ala Pro 465 470 475 gcc tca gtc cta gcc cca gcc ctg gct tca tcc ccc agc tca gca cca 1792 Ala Ser Val Leu Ala Pro Ala Leu Ala Ser Ser Pro Ser Ser Ala Pro 480 485 490 acc tca gcc acc acc tca acc tca tcc ccc acc tca gcc cca gcc cca 1840 Thr Ser Ala Thr Thr Ser Thr Ser Ser Pro Thr Ser Ala Pro Ala Pro 495 500 505 gct cca acc tca gct cca act tca acc cca gcc cca gcc cca agt cca 1888 Ala Pro Thr Ser Ala Pro Thr Ser Thr Pro Ala Pro Ala Pro Ser Pro 510 515 520 gct gca gcc gca act cca gcc cca gcc cca gtc cca gtc cca acc ctc 1936 Ala Ala Ala Ala Thr Pro Ala Pro Ala Pro Val Pro Val Pro Thr Leu 525 530 535 540 aca ccc cca tcc cca gcc cta acc cca gtc cca acc cca gcc cta agc 1984 Thr Pro Pro Ser Pro Ala Leu Thr Pro Val Pro Thr Pro Ala Leu Ser 545 550 555 cca gct cca act cca gcc cca acc cca gcc gca tcc cca gcc cta acc 2032 Pro Ala Pro Thr Pro Ala Pro Thr Pro Ala Ala Ser Pro Ala Leu Thr 560 565 570 cca gtc cca acc cca gcc cta agc cca gct cca act cca gcc cta acc 2080 Pro Val Pro Thr Pro Ala Leu Ser Pro Ala Pro Thr Pro Ala Leu Thr 575 580 585 cca gcc gca tcc cca gcc cta acc cca gtc cca acc cca gcc cta agc 2128 Pro Ala Ala Ser Pro Ala Leu Thr Pro Val Pro Thr Pro Ala Leu Ser 590 595 600 cca gct cca act cca gcc cca acc cca gcc gca tcc cct gcc cca gcc 2176 Pro Ala Pro Thr Pro Ala Pro Thr Pro Ala Ala Ser Pro Ala Pro Ala 605 610 615 620 ccc acc tca gcc cca acc cca acc cca gcc gca tcc cct gcc cca gct 2224 Pro Thr Ser Ala Pro Thr Pro Thr Pro Ala Ala Ser Pro Ala Pro Ala 625 630 635 gac ggg tca aag cct cag gag agt gtg gct ctc ccc agg cgc tac cag 2272 Asp Gly Ser Lys Pro Gln Glu Ser Val Ala Leu Pro Arg Arg Tyr Gln 640 645 650 gag ggg cag gtc tca gcc agc tgg gga aac ctt att gcc atg gtt ctt 2320 Glu Gly Gln Val Ser Ala Ser Trp Gly Asn Leu Ile Ala Met Val Leu 655 660 665 aga agc cac ccc ttc ccc agg caa gac agg ccc caa ggg agt gtc ccg 2368 Arg Ser His Pro Phe Pro Arg Gln Asp Arg Pro Gln Gly Ser Val Pro 670 675 680 agg gcg gtt ccc ggg agc ccc gtg ggt ccc agc act tcc aca cac tct 2416 Arg Ala Val Pro Gly Ser Pro Val Gly Pro Ser Thr Ser Thr His Ser 685 690 695 700 gag gac aga cac ggc ccc tct tct tca gtg ggg aca gtc ata ggg aca 2464 Glu Asp Arg His Gly Pro Ser Ser Ser Val Gly Thr Val Ile Gly Thr 705 710 715 ggt aca ggg ggc ctg gtt gag gct gga ggt cag cca cag cca aga agc 2512 Gly Thr Gly Gly Leu Val Glu Ala Gly Gly Gln Pro Gln Pro Arg Ser 720 725 730 tcc gag acc aac gga tcg ccc agc cca gac cct ccc cca ggc cta aga 2560 Ser Glu Thr Asn Gly Ser Pro Ser Pro Asp Pro Pro Pro Gly Leu Arg 735 740 745 gga gag gga acc agg gag aaa agt cta gac ccg ctg ccc caa gcc gcg 2608 Gly Glu Gly Thr Arg Glu Lys Ser Leu Asp Pro Leu Pro Gln Ala Ala 750 755 760 atg ccc agg ggc ccc gca cag ccc ccc gcg cag agg ccg cct ggc ccc 2656 Met Pro Arg Gly Pro Ala Gln Pro Pro Ala Gln Arg Pro Pro Gly Pro 765 770 775 780 gcg gcc tcc tcc tct gcg agg cgc tca cag ccg gta ccc cag cta cgg 2704 Ala Ala Ser Ser Ser Ala Arg Arg Ser Gln Pro Val Pro Gln Leu Arg 785 790 795 aaa cgc agc agg tgc gaa atc gcc ccg agc tcg gag cag gag gtc agg 2752 Lys Arg Ser Arg Cys Glu Ile Ala Pro Ser Ser Glu Gln Glu Val Arg 800 805 810 ccg gcc gcc tcg ggg gac cct caa ggg gag gcg ccg ggg gag ggg ggc 2800 Pro Ala Ala Ser Gly Asp Pro Gln Gly Glu Ala Pro Gly Glu Gly Gly 815 820 825 agc cct gcc ggc cgc agc tgg gcg ctc acg gaa aag cag gag gag gcc 2848 Ser Pro Ala Gly Arg Ser Trp Ala Leu Thr Glu Lys Gln Glu Glu Ala 830 835 840 cgg aag ctc atg gtg ttt ctg cag agg ccc ggg ggt tgg ggg ttg gtg 2896 Arg Lys Leu Met Val Phe Leu Gln Arg Pro Gly Gly Trp Gly Leu Val 845 850 855 860 gag ggg ccc cgg aag ccc agc tcc cgg gcc ctg gag ccc gcc acg gcg 2944 Glu Gly Pro Arg Lys Pro Ser Ser Arg Ala Leu Glu Pro Ala Thr Ala 865 870 875 gca gcc ctg cgg cgg cgg ctg gac ctg ggc agt tgc ctg gac gtg ctg 2992 Ala Ala Leu Arg Arg Arg Leu Asp Leu Gly Ser Cys Leu Asp Val Leu 880 885 890 gcc ttt gcc cgg cag cac gga gag ccc ggc ctg gcg cag gag acc tac 3040 Ala Phe Ala Arg Gln His Gly Glu Pro Gly Leu Ala Gln Glu Thr Tyr 895 900 905 gcg ctg atg agc gac aac ctg ctg cga gtg ctg gga gac ccg tgc ctc 3088 Ala Leu Met Ser Asp Asn Leu Leu Arg Val Leu Gly Asp Pro Cys Leu 910 915 920 tac cgc cgg ctg agc gcg gcc gac cgc gag cgc atc ctc agc ctg cgg 3136 Tyr Arg Arg Leu Ser Ala Ala Asp Arg Glu Arg Ile Leu Ser Leu Arg 925 930 935 940 acc ggc cgg ggc cgg gcg gtg ctg ggc gtc ctc gta ctg ccc agc ctc 3184 Thr Gly Arg Gly Arg Ala Val Leu Gly Val Leu Val Leu Pro Ser Leu 945 950 955 tac cag ggg ggc cgc tca ggg ctc ccc agg ggc cct cgt ggc gag gag 3232 Tyr Gln Gly Gly Arg Ser Gly Leu Pro Arg Gly Pro Arg Gly Glu Glu 960 965 970 cct cct gcg gcg gcc cct gtg tcc ctg cct cta cct gcg cac ctg cat 3280 Pro Pro Ala Ala Ala Pro Val Ser Leu Pro Leu Pro Ala His Leu His 975 980 985 gtg ttc aac ccc cgg gag aac acc tgg cgg ccc ctg acc cag gtg ccc 3328 Val Phe Asn Pro Arg Glu Asn Thr Trp Arg Pro Leu Thr Gln Val Pro 990 995 1000 gag gag gcc ccg ctt cgg ggc tgc ggt ctc tgc acc atg cac aac 3373 Glu Glu Ala Pro Leu Arg Gly Cys Gly Leu Cys Thr Met His Asn 1005 1010 1015 tac ctg ttt ctg gcg ggg ggc atc cgt ggc tcc ggt gcc aag gcc 3418 Tyr Leu Phe Leu Ala Gly Gly Ile Arg Gly Ser Gly Ala Lys Ala 1020 1025 1030 gtc tgc tcc aac gag gtc ttc tgc tac aac cct ctg acc aac atc 3463 Val Cys Ser Asn Glu Val Phe Cys Tyr Asn Pro Leu Thr Asn Ile 1035 1040 1045 tgg agc cag gtt cgg ccc atg cag cag gcc cga gcc cag ctc aag 3508 Trp Ser Gln Val Arg Pro Met Gln Gln Ala Arg Ala Gln Leu Lys 1050 1055 1060 ctg gtg gcc ctg gac ggg ctg ctc tat gcc atc ggt ggc gaa tgc 3553 Leu Val Ala Leu Asp Gly Leu Leu Tyr Ala Ile Gly Gly Glu Cys 1065 1070 1075 ctg tac agc atg gag tgc tac gac ccg cga aca gac gcc tgg acc 3598 Leu Tyr Ser Met Glu Cys Tyr Asp Pro Arg Thr Asp Ala Trp Thr 1080 1085 1090 cca cgc gcg cca ctc ccc gca ggc acc ttc cct gtg gcc cac gag 3643 Pro Arg Ala Pro Leu Pro Ala Gly Thr Phe Pro Val Ala His Glu 1095 1100 1105 gct gtg gcc tgc cgt ggg gac atc tac gtc acc ggg ggt cac ctc 3688 Ala Val Ala Cys Arg Gly Asp Ile Tyr Val Thr Gly Gly His Leu 1110 1115 1120 ttc tac cgc ctg ctc agg tac agc ccc gtg aag gat gct tgg gac 3733 Phe Tyr Arg Leu Leu Arg Tyr Ser Pro Val Lys Asp Ala Trp Asp 1125 1130 1135 gag tgc cca tac agt gcc agc cac cgg cgt tcc agc gac atc gtg 3778 Glu Cys Pro Tyr Ser Ala Ser His Arg Arg Ser Ser Asp Ile Val 1140 1145 1150 gca ctg ggg ggc ttc ctg tac cgc ttc gac ctg ctg cgg ggc gtg 3823 Ala Leu Gly Gly Phe Leu Tyr Arg Phe Asp Leu Leu Arg Gly Val 1155 1160 1165 ggc gcc gcc gtg atg cgc tac aac aca gtg acc ggc tcc tgg agc 3868 Gly Ala Ala Val Met Arg Tyr Asn Thr Val Thr Gly Ser Trp Ser 1170 1175 1180 agg gct gcc tcc ctg ccc ctg ccc gcc ccc gcc cca ctg cgc tgc 3913 Arg Ala Ala Ser Leu Pro Leu Pro Ala Pro Ala Pro Leu Arg Cys 1185 1190 1195 acc acc ctg ggc aac acc att tac tgc ctc aac ccc cag gtc act 3958 Thr Thr Leu Gly Asn Thr Ile Tyr Cys Leu Asn Pro Gln Val Thr 1200 1205 1210 gcc acc ttc acg gtc tct ggg ggg act gcc cag ttc cag gcc aag 4003 Ala Thr Phe Thr Val Ser Gly Gly Thr Ala Gln Phe Gln Ala Lys 1215 1220 1225 gag ctg cag ccc ttc ccc ttg ggg agc acc ggg gtc ctc agt cca 4048 Glu Leu Gln Pro Phe Pro Leu Gly Ser Thr Gly Val Leu Ser Pro 1230 1235 1240 ttc atc ctg act ctg ccc cct gag gac cgg ctg cag acc tca ctc 4093 Phe Ile Leu Thr Leu Pro Pro Glu Asp Arg Leu Gln Thr Ser Leu 1245 1250 1255 tga gtggcaggca gagaaccaaa gctgcttcgc tgctctccag ggagaccctc 4146 ctgggatggg cctgagaggc cgggactcag ggaaggggct gggatcggaa cttcctgctc 4206 ttgtttctgg acaactttcc ccttctgctt taaaggttgt cgattatttt gaagcccaga 4266 ctccctcagc ctctttctgc ccctcactcc acacccagac tgtttcctga ctcaattccg 4326 tacctactta cagaccctct cagcttgctg acacccccct gtctgtggga ctccctattc 4386 cctagagcca gggactgatg tgtccccaca gacaaggact tggctcgctg gagctctgct 4446 gagccgagag aggagggggt agaaaacatt cacacttcct atgctctgtc agcaggacag 4506 ggagcaaaaa cgtccccagg caacgccctc gcctctgtga ctttctgcct gtcctaaggc 4566 ctccccaggt accaaccccg tagctatctg ggtctgtttg gcactgtgga ttctcaaggg 4626 cctagaaccc ttgcctctga aactggtccg ctggtgcagc cctgctgtct gcagctcctg 4686 cccatacccc cagcccacac caggccaggc ccactccggg ctcaccaccc tctgcagcct 4746 tgtggggctc tcccagcccc tccagaagcc caccccactt ctcgccaacc cccgatctct 4806 aaatgaggcc tgagcgtcac cctagttctg ccccttttta gctgtgtaga cttggacgag 4866 acatttgact tccctttctc cttgtctata aaatgtggac agtggacgtc tgtcacccaa 4926 gagagttgtg ggagacaaga tcacagctat gagcacctcg cacggtgtcc aggatgcaca 4986 gcacaatcca tgatgtgttt tctcccctta cgcactttga aacccatgct agaaaggtga 5046 atacatctga ctgtgctcca ctccaacctc cagcctggat gtccctgtct gggccctttt 5106 tctgtttttt attctatgtt cagcaccact ggcaccaaat acattttaat tcaccgaaag 5166 caaaaaaaaa aaaaaaaa 5184 8 1259 PRT Homo sapiens 8 Met Ile Gln Gly Thr Leu Glu Pro Asp Gly Pro Leu Trp Gly Trp Asp 1 5 10 15 Trp Asp Ser Asp Asn Asp Trp Asp Ser Ala Val Leu Ala Leu Leu Ala 20 25 30 Leu Ala Val Val Ala Ala Thr Ala Leu Ala Leu His Trp Phe Gly Ser 35 40 45 Gly His Asp Gln Glu Ala Ala Glu Pro Val Ser Thr Ala Leu Gly Ala 50 55 60 Gln Pro His Gln Ala Gly Gly Ala Glu Leu Ala Leu Gln Pro Lys Ser 65 70 75 80 Lys Val Ser Asp Gly Ser Glu Gly Gln Ser Pro Gly Gln Gly Lys Pro 85 90 95 Glu Pro Pro Gly Arg Gly Gln Gln Ser Pro Val Pro Ala Ala Ala Pro 100 105 110 Gly Gly Gly Leu Ala Ala Met Ala Arg Leu Pro Leu Lys Thr Ala Val 115 120 125 Glu Glu Ala Arg Arg Glu Ala Leu Gly Gln Gln Arg Gly Ser Ala Thr 130 135 140 Pro Ala Ala Pro Arg Ala Glu Gly Lys Glu Pro Pro Arg Pro Gly Thr 145 150 155 160 Ala Leu Leu Gly Arg Ser Glu Ala Gly Gly Met Ser Ala Pro Leu Leu 165 170 175 Ile His Phe Thr Pro Arg Ser Pro Gly Ser Glu Ala Glu Ala Glu Thr 180 185 190 Gly Gly Val Arg Ala Ser Ser Arg Gln Ala Ala Gly Pro Ala Gly Gln 195 200 205 Gln Asp Thr Gly Pro Trp Gln Ala Gly Ala Gly Pro Ser Gly Ser Met 210 215 220 Gly Arg Gly Arg Gly Arg Arg Arg Arg Met Asp Ala Gly Ser Gly Asp 225 230 235 240 Arg Ala Arg Arg Pro Arg Lys Leu Asp Pro Leu Arg Leu Gly Ala Ala 245 250 255 Gly Ser Val Trp Asp Ala Val Asp Gly Ala Ala Ala Leu Asp Ala His 260 265 270 Ala Arg Gly Leu Pro Thr Gly Pro Pro Leu Ala Gln Glu Pro Ala Leu 275 280 285 Pro Ala Leu Pro Ala Pro Arg Ala Leu Gln Pro Gly Ser Gln Thr Glu 290 295 300 Gly Ser Gly Ala Lys Gly Gly Trp Ser Arg Glu Ala Ser Gly Val Pro 305 310 315 320 Ala Pro Gly Gly Gly Trp Pro Trp Val Ser Arg Glu Val Pro Gly Thr 325 330 335 Arg Ser Phe Gly Pro Ala Pro Gly Ser Thr Arg Pro Trp Leu Glu Ser 340 345 350 Pro Pro Gln Gly Arg Pro Leu Ser Ser Gln Gly Pro Gly Ala Thr Gly 355 360 365 Ala Tyr Asp Ala Gly Glu Ala Gly Ala Asp Ser Ser Arg Asp Asn Ser 370 375 380 Pro Ala Ala Asp Leu Gly Pro Thr Arg Pro Pro Glu Gln Ala Lys Pro 385 390 395 400 Ala Ala Ala Gly His Ser Arg Ala Pro Ser Arg Ser Arg Glu Pro Arg 405 410 415 Pro Arg Ser Ala Ser Pro Pro Ala Ala Pro Gly Pro Gly Phe Pro Pro 420 425 430 Glu Ala Leu Thr Leu Pro Ser Pro Ser Asp Phe Leu Pro Leu Glu Val 435 440 445 Thr Gln Asp Pro Ser Val Gly Glu Asn Leu Arg Ala Ala Pro Ala Pro 450 455 460 Ser Ser Ala Ser Ala Gln Val Leu Thr Ser Ala Pro Ala Ser Val Leu 465 470 475 480 Ala Pro Ala Leu Ala Ser Ser Pro Ser Ser Ala Pro Thr Ser Ala Thr 485 490 495 Thr Ser Thr Ser Ser Pro Thr Ser Ala Pro Ala Pro Ala Pro Thr Ser 500 505 510 Ala Pro Thr Ser Thr Pro Ala Pro Ala Pro Ser Pro Ala Ala Ala Ala 515 520 525 Thr Pro Ala Pro Ala Pro Val Pro Val Pro Thr Leu Thr Pro Pro Ser 530 535 540 Pro Ala Leu Thr Pro Val Pro Thr Pro Ala Leu Ser Pro Ala Pro Thr 545 550 555 560 Pro Ala Pro Thr Pro Ala Ala Ser Pro Ala Leu Thr Pro Val Pro Thr 565 570 575 Pro Ala Leu Ser Pro Ala Pro Thr Pro Ala Leu Thr Pro Ala Ala Ser 580 585 590 Pro Ala Leu Thr Pro Val Pro Thr Pro Ala Leu Ser Pro Ala Pro Thr 595 600 605 Pro Ala Pro Thr Pro Ala Ala Ser Pro Ala Pro Ala Pro Thr Ser Ala 610 615 620 Pro Thr Pro Thr Pro Ala Ala Ser Pro Ala Pro Ala Asp Gly Ser Lys 625 630 635 640 Pro Gln Glu Ser Val Ala Leu Pro Arg Arg Tyr Gln Glu Gly Gln Val 645 650 655 Ser Ala Ser Trp Gly Asn Leu Ile Ala Met Val Leu Arg Ser His Pro 660 665 670 Phe Pro Arg Gln Asp Arg Pro Gln Gly Ser Val Pro Arg Ala Val Pro 675 680 685 Gly Ser Pro Val Gly Pro Ser Thr Ser Thr His Ser Glu Asp Arg His 690 695 700 Gly Pro Ser Ser Ser Val Gly Thr Val Ile Gly Thr Gly Thr Gly Gly 705 710 715 720 Leu Val Glu Ala Gly Gly Gln Pro Gln Pro Arg Ser Ser Glu Thr Asn 725 730 735 Gly Ser Pro Ser Pro Asp Pro Pro Pro Gly Leu Arg Gly Glu Gly Thr 740 745 750 Arg Glu Lys Ser Leu Asp Pro Leu Pro Gln Ala Ala Met Pro Arg Gly 755 760 765 Pro Ala Gln Pro Pro Ala Gln Arg Pro Pro Gly Pro Ala Ala Ser Ser 770 775 780 Ser Ala Arg Arg Ser Gln Pro Val Pro Gln Leu Arg Lys Arg Ser Arg 785 790 795 800 Cys Glu Ile Ala Pro Ser Ser Glu Gln Glu Val Arg Pro Ala Ala Ser 805 810 815 Gly Asp Pro Gln Gly Glu Ala Pro Gly Glu Gly Gly Ser Pro Ala Gly 820 825 830 Arg Ser Trp Ala Leu Thr Glu Lys Gln Glu Glu Ala Arg Lys Leu Met 835 840 845 Val Phe Leu Gln Arg Pro Gly Gly Trp Gly Leu Val Glu Gly Pro Arg 850 855 860 Lys Pro Ser Ser Arg Ala Leu Glu Pro Ala Thr Ala Ala Ala Leu Arg 865 870 875 880 Arg Arg Leu Asp Leu Gly Ser Cys Leu Asp Val Leu Ala Phe Ala Arg 885 890 895 Gln His Gly Glu Pro Gly Leu Ala Gln Glu Thr Tyr Ala Leu Met Ser 900 905 910 Asp Asn Leu Leu Arg Val Leu Gly Asp Pro Cys Leu Tyr Arg Arg Leu 915 920 925 Ser Ala Ala Asp Arg Glu Arg Ile Leu Ser Leu Arg Thr Gly Arg Gly 930 935 940 Arg Ala Val Leu Gly Val Leu Val Leu Pro Ser Leu Tyr Gln Gly Gly 945 950 955 960 Arg Ser Gly Leu Pro Arg Gly Pro Arg Gly Glu Glu Pro Pro Ala Ala 965 970 975 Ala Pro Val Ser Leu Pro Leu Pro Ala His Leu His Val Phe Asn Pro 980 985 990 Arg Glu Asn Thr Trp Arg Pro Leu Thr Gln Val Pro Glu Glu Ala Pro 995 1000 1005 Leu Arg Gly Cys Gly Leu Cys Thr Met His Asn Tyr Leu Phe Leu 1010 1015 1020 Ala Gly Gly Ile Arg Gly Ser Gly Ala Lys Ala Val Cys Ser Asn 1025 1030 1035 Glu Val Phe Cys Tyr Asn Pro Leu Thr Asn Ile Trp Ser Gln Val 1040 1045 1050 Arg Pro Met Gln Gln Ala Arg Ala Gln Leu Lys Leu Val Ala Leu 1055 1060 1065 Asp Gly Leu Leu Tyr Ala Ile Gly Gly Glu Cys Leu Tyr Ser Met 1070 1075 1080 Glu Cys Tyr Asp Pro Arg Thr Asp Ala Trp Thr Pro Arg Ala Pro 1085 1090 1095 Leu Pro Ala Gly Thr Phe Pro Val Ala His Glu Ala Val Ala Cys 1100 1105 1110 Arg Gly Asp Ile Tyr Val Thr Gly Gly His Leu Phe Tyr Arg Leu 1115 1120 1125 Leu Arg Tyr Ser Pro Val Lys Asp Ala Trp Asp Glu Cys Pro Tyr 1130 1135 1140 Ser Ala Ser His Arg Arg Ser Ser Asp Ile Val Ala Leu Gly Gly 1145 1150 1155 Phe Leu Tyr Arg Phe Asp Leu Leu Arg Gly Val Gly Ala Ala Val 1160 1165 1170 Met Arg Tyr Asn Thr Val Thr Gly Ser Trp Ser Arg Ala Ala Ser 1175 1180 1185 Leu Pro Leu Pro Ala Pro Ala Pro Leu Arg Cys Thr Thr Leu Gly 1190 1195 1200 Asn Thr Ile Tyr Cys Leu Asn Pro Gln Val Thr Ala Thr Phe Thr 1205 1210 1215 Val Ser Gly Gly Thr Ala Gln Phe Gln Ala Lys Glu Leu Gln Pro 1220 1225 1230 Phe Pro Leu Gly Ser Thr Gly Val Leu Ser Pro Phe Ile Leu Thr 1235 1240 1245 Leu Pro Pro Glu Asp Arg Leu Gln Thr Ser Leu 1250 1255 9 1053 DNA Homo sapiens CDS (1)..(1053) 9 atg ctc gaa ggt gcg gag ctg tac ttc aac gtg gac cat ggc tac ctg 48 Met Leu Glu Gly Ala Glu Leu Tyr Phe Asn Val Asp His Gly Tyr Leu 1 5 10 15 gag ggc ctg gtt cga gga tgc aag gcc agc ctc ctg acc cag caa gac 96 Glu Gly Leu Val Arg Gly Cys Lys Ala Ser Leu Leu Thr Gln Gln Asp 20 25 30 tat atc aac ctg gtc cag tgt gag acc cta gaa gac ctg aaa att cat 144 Tyr Ile Asn Leu Val Gln Cys Glu Thr Leu Glu Asp Leu Lys Ile His 35 40 45 ctc cag act act gat tat ggt aac ttt ttg gct aat cac aca aat cct 192 Leu Gln Thr Thr Asp Tyr Gly Asn Phe Leu Ala Asn His Thr Asn Pro 50 55 60 ctt act gtt tcc aaa att gac gct gag atg agg aag aga cta tgt gga 240 Leu Thr Val Ser Lys Ile Asp Ala Glu Met Arg Lys Arg Leu Cys Gly 65 70 75 80 gaa ttt gag tat ttc cgg aat cat tcc ctg gag ccc ctc agc aca ttt 288 Glu Phe Glu Tyr Phe Arg Asn His Ser Leu Glu Pro Leu Ser Thr Phe 85 90 95 ctc acc tat atg acg tgc agt tat atg ata gac aat gtg att ctg ctg 336 Leu Thr Tyr Met Thr Cys Ser Tyr Met Ile Asp Asn Val Ile Leu Leu 100 105 110 atg aat ggt gca ttg cag aaa aaa tct gtg aaa gaa att ctg ggg aag 384 Met Asn Gly Ala Leu Gln Lys Lys Ser Val Lys Glu Ile Leu Gly Lys 115 120 125 tgc cac ccc ttg ggc cgt ttc aca gaa atg gaa gct gtc aac att gca 432 Cys His Pro Leu Gly Arg Phe Thr Glu Met Glu Ala Val Asn Ile Ala 130 135 140 gag aca cct tca gat ctc ttt aat gcc att ctg atc gaa acg cca tta 480 Glu Thr Pro Ser Asp Leu Phe Asn Ala Ile Leu Ile Glu Thr Pro Leu 145 150 155 160 gct cca ttc ttc caa gac tgc atg tct gaa aat gct cta gat gaa ctg 528 Ala Pro Phe Phe Gln Asp Cys Met Ser Glu Asn Ala Leu Asp Glu Leu 165 170 175 aat att gaa ttg cta cgc aat aaa cta tac aag tct tac ctt gag gca 576 Asn Ile Glu Leu Leu Arg Asn Lys Leu Tyr Lys Ser Tyr Leu Glu Ala 180 185 190 ttc tat aaa ttc tgt aag aat cat ggt gat gtc aca gca gaa gtt atg 624 Phe Tyr Lys Phe Cys Lys Asn His Gly Asp Val Thr Ala Glu Val Met 195 200 205 tgt ccc att ctt gag ttt gag gcc gac aga cgt gct ttt atc atc act 672 Cys Pro Ile Leu Glu Phe Glu Ala Asp Arg Arg Ala Phe Ile Ile Thr 210 215 220 ctt aac tcc ttt ggc act gaa ttg agc aaa gaa gac cga gag acc ctc 720 Leu Asn Ser Phe Gly Thr Glu Leu Ser Lys Glu Asp Arg Glu Thr Leu 225 230 235 240 tat cca acc ttc ggc aaa ctc tat cct gag ggg ttg cgg ctg ttg gct 768 Tyr Pro Thr Phe Gly Lys Leu Tyr Pro Glu Gly Leu Arg Leu Leu Ala 245 250 255 caa gca gaa gac ttt gac cag atg aag aac gta gcg gat cat tac gga 816 Gln Ala Glu Asp Phe Asp Gln Met Lys Asn Val Ala Asp His Tyr Gly 260 265 270 gta tac aaa cct tta ttt gaa gct rta ggt ggc agt ggg gga aag aca 864 Val Tyr Lys Pro Leu Phe Glu Ala Xaa Gly Gly Ser Gly Gly Lys Thr 275 280 285 ttg gag gac gtg ttt tac gag cgt gag gta caa atg aat gtg ctg gca 912 Leu Glu Asp Val Phe Tyr Glu Arg Glu Val Gln Met Asn Val Leu Ala 290 295 300 ttc aac aga cag ttc cac tac ggt gtg ttt tat gca tat gta aag ctg 960 Phe Asn Arg Gln Phe His Tyr Gly Val Phe Tyr Ala Tyr Val Lys Leu 305 310 315 320 aag gaa cag gaa att aga aat att gtg tgg ata gca gaa tgt att tca 1008 Lys Glu Gln Glu Ile Arg Asn Ile Val Trp Ile Ala Glu Cys Ile Ser 325 330 335 cag agg cat cga act aaa atc aac agt tac att cca att tta taa 1053 Gln Arg His Arg Thr Lys Ile Asn Ser Tyr Ile Pro Ile Leu 340 345 350 10 350 PRT Homo sapiens misc_feature (281)..(281) The ′Xaa′ at location 281 stands for Val, or Ile. 10 Met Leu Glu Gly Ala Glu Leu Tyr Phe Asn Val Asp His Gly Tyr Leu 1 5 10 15 Glu Gly Leu Val Arg Gly Cys Lys Ala Ser Leu Leu Thr Gln Gln Asp 20 25 30 Tyr Ile Asn Leu Val Gln Cys Glu Thr Leu Glu Asp Leu Lys Ile His 35 40 45 Leu Gln Thr Thr Asp Tyr Gly Asn Phe Leu Ala Asn His Thr Asn Pro 50 55 60 Leu Thr Val Ser Lys Ile Asp Ala Glu Met Arg Lys Arg Leu Cys Gly 65 70 75 80 Glu Phe Glu Tyr Phe Arg Asn His Ser Leu Glu Pro Leu Ser Thr Phe 85 90 95 Leu Thr Tyr Met Thr Cys Ser Tyr Met Ile Asp Asn Val Ile Leu Leu 100 105 110 Met Asn Gly Ala Leu Gln Lys Lys Ser Val Lys Glu Ile Leu Gly Lys 115 120 125 Cys His Pro Leu Gly Arg Phe Thr Glu Met Glu Ala Val Asn Ile Ala 130 135 140 Glu Thr Pro Ser Asp Leu Phe Asn Ala Ile Leu Ile Glu Thr Pro Leu 145 150 155 160 Ala Pro Phe Phe Gln Asp Cys Met Ser Glu Asn Ala Leu Asp Glu Leu 165 170 175 Asn Ile Glu Leu Leu Arg Asn Lys Leu Tyr Lys Ser Tyr Leu Glu Ala 180 185 190 Phe Tyr Lys Phe Cys Lys Asn His Gly Asp Val Thr Ala Glu Val Met 195 200 205 Cys Pro Ile Leu Glu Phe Glu Ala Asp Arg Arg Ala Phe Ile Ile Thr 210 215 220 Leu Asn Ser Phe Gly Thr Glu Leu Ser Lys Glu Asp Arg Glu Thr Leu 225 230 235 240 Tyr Pro Thr Phe Gly Lys Leu Tyr Pro Glu Gly Leu Arg Leu Leu Ala 245 250 255 Gln Ala Glu Asp Phe Asp Gln Met Lys Asn Val Ala Asp His Tyr Gly 260 265 270 Val Tyr Lys Pro Leu Phe Glu Ala Xaa Gly Gly Ser Gly Gly Lys Thr 275 280 285 Leu Glu Asp Val Phe Tyr Glu Arg Glu Val Gln Met Asn Val Leu Ala 290 295 300 Phe Asn Arg Gln Phe His Tyr Gly Val Phe Tyr Ala Tyr Val Lys Leu 305 310 315 320 Lys Glu Gln Glu Ile Arg Asn Ile Val Trp Ile Ala Glu Cys Ile Ser 325 330 335 Gln Arg His Arg Thr Lys Ile Asn Ser Tyr Ile Pro Ile Leu 340 345 350 11 1197 DNA Homo sapiens CDS (1)..(1194) 11 atg agg aag cgg acc gag ccc gtc gcc ttg gag cat gag cgc tgc gcc 48 Met Arg Lys Arg Thr Glu Pro Val Ala Leu Glu His Glu Arg Cys Ala 1 5 10 15 gcc gcg ggc tcg tcc tcc tcc ggc tcg gcc gcc gcg gcg ctg gac gcc 96 Ala Ala Gly Ser Ser Ser Ser Gly Ser Ala Ala Ala Ala Leu Asp Ala 20 25 30 gac tgc cgc ctg aag cag aac cta cgc ctg acg ggc ccg gcg gcg gct 144 Asp Cys Arg Leu Lys Gln Asn Leu Arg Leu Thr Gly Pro Ala Ala Ala 35 40 45 gag ccg cgc tgc gca gcc gac gcg gga atg aag cgg gcg ctg ggc agg 192 Glu Pro Arg Cys Ala Ala Asp Ala Gly Met Lys Arg Ala Leu Gly Arg 50 55 60 cga aag ggc gtg tgg ttg cgc ctg agg aag ata ctt ttc tgt gtt ttg 240 Arg Lys Gly Val Trp Leu Arg Leu Arg Lys Ile Leu Phe Cys Val Leu 65 70 75 80 ggg ttg tac att gcc att cca ttt ctc atc aaa cta tgt cct gga ata 288 Gly Leu Tyr Ile Ala Ile Pro Phe Leu Ile Lys Leu Cys Pro Gly Ile 85 90 95 cag gcc aaa ctg att ttc ttg aat ttc gta aga gtt ccc tat ttc att 336 Gln Ala Lys Leu Ile Phe Leu Asn Phe Val Arg Val Pro Tyr Phe Ile 100 105 110 gat ttg aaa aaa cca cag gat caa ggt ttg aat cac acg tgt aac tac 384 Asp Leu Lys Lys Pro Gln Asp Gln Gly Leu Asn His Thr Cys Asn Tyr 115 120 125 tac ctg cag cca gag gaa gac gtg acc att gga gtc tgg cac acc gtc 432 Tyr Leu Gln Pro Glu Glu Asp Val Thr Ile Gly Val Trp His Thr Val 130 135 140 cct gca gtc tgg tgg aag aac gcc caa ggc aaa gac cag atg tgg tat 480 Pro Ala Val Trp Trp Lys Asn Ala Gln Gly Lys Asp Gln Met Trp Tyr 145 150 155 160 gag gat gcc ttg gct tcc agc cac cct atc att ctg tac ctg cat ggg 528 Glu Asp Ala Leu Ala Ser Ser His Pro Ile Ile Leu Tyr Leu His Gly 165 170 175 aac gca ggt acc aga gga ggc gac cac cgc gtg gag ctt tac aag gtg 576 Asn Ala Gly Thr Arg Gly Gly Asp His Arg Val Glu Leu Tyr Lys Val 180 185 190 ctg agt tcc ctt ggt tac cat gtg gtc acc ttt gac tac aga ggt tgg 624 Leu Ser Ser Leu Gly Tyr His Val Val Thr Phe Asp Tyr Arg Gly Trp 195 200 205 ggt gac tca gtg gga acg cca tct gag cgg ggc atg acc tat gac gca 672 Gly Asp Ser Val Gly Thr Pro Ser Glu Arg Gly Met Thr Tyr Asp Ala 210 215 220 ctc cac gtt ttt gac tgg atc aaa gca aga agt ggt gac aac ccc gtg 720 Leu His Val Phe Asp Trp Ile Lys Ala Arg Ser Gly Asp Asn Pro Val 225 230 235 240 tac atc tgg ggc cac tct ctg ggc act ggc gtg gcg aca aat ctg gtg 768 Tyr Ile Trp Gly His Ser Leu Gly Thr Gly Val Ala Thr Asn Leu Val 245 250 255 cgg cgc ctc tgt gag cga gag acg cct cca gat gcc ctt ata ttg gaa 816 Arg Arg Leu Cys Glu Arg Glu Thr Pro Pro Asp Ala Leu Ile Leu Glu 260 265 270 tct cca ttc act aat atc cgc gaa gaa gct aag agc cat cca ttt tca 864 Ser Pro Phe Thr Asn Ile Arg Glu Glu Ala Lys Ser His Pro Phe Ser 275 280 285 gtg ata tat cga tac ttc cct ggg ttt gac tgg ttc ttc ctt gat cct 912 Val Ile Tyr Arg Tyr Phe Pro Gly Phe Asp Trp Phe Phe Leu Asp Pro 290 295 300 att aca agt agt gga att aaa ttt gca aat gat gaa aac gtg aag cac 960 Ile Thr Ser Ser Gly Ile Lys Phe Ala Asn Asp Glu Asn Val Lys His 305 310 315 320 atc tcc tgt ccc ctg ctc atc ctg cac gct gag gac gac ccg gtg gtg 1008 Ile Ser Cys Pro Leu Leu Ile Leu His Ala Glu Asp Asp Pro Val Val 325 330 335 ccc ttc cag ctt ggc aga aag ctc tat agc atc gcc gca cca gct cga 1056 Pro Phe Gln Leu Gly Arg Lys Leu Tyr Ser Ile Ala Ala Pro Ala Arg 340 345 350 agc ttc cga gat ttc aaa gtt cag ttt gtg ccc ttt cat tca gac ctt 1104 Ser Phe Arg Asp Phe Lys Val Gln Phe Val Pro Phe His Ser Asp Leu 355 360 365 ggc tac agg cac aaa tac att tac aag agc cct gag ctg cca cgg ata 1152 Gly Tyr Arg His Lys Tyr Ile Tyr Lys Ser Pro Glu Leu Pro Arg Ile 370 375 380 ctg agg gaa ttc ctg ggg aag tcg gag cct gag cac cag cac tga 1197 Leu Arg Glu Phe Leu Gly Lys Ser Glu Pro Glu His Gln His 385 390 395 12 398 PRT Homo sapiens 12 Met Arg Lys Arg Thr Glu Pro Val Ala Leu Glu His Glu Arg Cys Ala 1 5 10 15 Ala Ala Gly Ser Ser Ser Ser Gly Ser Ala Ala Ala Ala Leu Asp Ala 20 25 30 Asp Cys Arg Leu Lys Gln Asn Leu Arg Leu Thr Gly Pro Ala Ala Ala 35 40 45 Glu Pro Arg Cys Ala Ala Asp Ala Gly Met Lys Arg Ala Leu Gly Arg 50 55 60 Arg Lys Gly Val Trp Leu Arg Leu Arg Lys Ile Leu Phe Cys Val Leu 65 70 75 80 Gly Leu Tyr Ile Ala Ile Pro Phe Leu Ile Lys Leu Cys Pro Gly Ile 85 90 95 Gln Ala Lys Leu Ile Phe Leu Asn Phe Val Arg Val Pro Tyr Phe Ile 100 105 110 Asp Leu Lys Lys Pro Gln Asp Gln Gly Leu Asn His Thr Cys Asn Tyr 115 120 125 Tyr Leu Gln Pro Glu Glu Asp Val Thr Ile Gly Val Trp His Thr Val 130 135 140 Pro Ala Val Trp Trp Lys Asn Ala Gln Gly Lys Asp Gln Met Trp Tyr 145 150 155 160 Glu Asp Ala Leu Ala Ser Ser His Pro Ile Ile Leu Tyr Leu His Gly 165 170 175 Asn Ala Gly Thr Arg Gly Gly Asp His Arg Val Glu Leu Tyr Lys Val 180 185 190 Leu Ser Ser Leu Gly Tyr His Val Val Thr Phe Asp Tyr Arg Gly Trp 195 200 205 Gly Asp Ser Val Gly Thr Pro Ser Glu Arg Gly Met Thr Tyr Asp Ala 210 215 220 Leu His Val Phe Asp Trp Ile Lys Ala Arg Ser Gly Asp Asn Pro Val 225 230 235 240 Tyr Ile Trp Gly His Ser Leu Gly Thr Gly Val Ala Thr Asn Leu Val 245 250 255 Arg Arg Leu Cys Glu Arg Glu Thr Pro Pro Asp Ala Leu Ile Leu Glu 260 265 270 Ser Pro Phe Thr Asn Ile Arg Glu Glu Ala Lys Ser His Pro Phe Ser 275 280 285 Val Ile Tyr Arg Tyr Phe Pro Gly Phe Asp Trp Phe Phe Leu Asp Pro 290 295 300 Ile Thr Ser Ser Gly Ile Lys Phe Ala Asn Asp Glu Asn Val Lys His 305 310 315 320 Ile Ser Cys Pro Leu Leu Ile Leu His Ala Glu Asp Asp Pro Val Val 325 330 335 Pro Phe Gln Leu Gly Arg Lys Leu Tyr Ser Ile Ala Ala Pro Ala Arg 340 345 350 Ser Phe Arg Asp Phe Lys Val Gln Phe Val Pro Phe His Ser Asp Leu 355 360 365 Gly Tyr Arg His Lys Tyr Ile Tyr Lys Ser Pro Glu Leu Pro Arg Ile 370 375 380 Leu Arg Glu Phe Leu Gly Lys Ser Glu Pro Glu His Gln His 385 390 395 13 324 DNA Homo sapiens CDS (1)..(321) First open reading frame 13 atg gca ctg cct cag gcc cag ctc cac ctc ctc cct gca ccc tcc cag 48 Met Ala Leu Pro Gln Ala Gln Leu His Leu Leu Pro Ala Pro Ser Gln 1 5 10 15 cag ttg ctg cag atg aga gcg gcg atg gga cca ggg tcc aga gca atg 96 Gln Leu Leu Gln Met Arg Ala Ala Met Gly Pro Gly Ser Arg Ala Met 20 25 30 gag gct ccg tgc ctg gga gcg ggt tct gcc ggc tgc gga agg ata ggg 144 Glu Ala Pro Cys Leu Gly Ala Gly Ser Ala Gly Cys Gly Arg Ile Gly 35 40 45 gtg gca cag tca gcc gcc ttg ggg aca cag gac ata ggg gac cca cca 192 Val Ala Gln Ser Ala Ala Leu Gly Thr Gln Asp Ile Gly Asp Pro Pro 50 55 60 ccg cca ctg ccg ctc ccg cag cag ctc gtg ccg cca cca ccc acg cct 240 Pro Pro Leu Pro Leu Pro Gln Gln Leu Val Pro Pro Pro Pro Thr Pro 65 70 75 80 cct tgc tgc agc cag cgt gac gac agc ggc tgc tcc gga cca ccc gca 288 Pro Cys Cys Ser Gln Arg Asp Asp Ser Gly Cys Ser Gly Pro Pro Ala 85 90 95 agc ggc cat cac ggc aag gga cta acc gtc gtg taa 324 Ser Gly His His Gly Lys Gly Leu Thr Val Val 100 105 14 107 PRT Homo sapiens 14 Met Ala Leu Pro Gln Ala Gln Leu His Leu Leu Pro Ala Pro Ser Gln 1 5 10 15 Gln Leu Leu Gln Met Arg Ala Ala Met Gly Pro Gly Ser Arg Ala Met 20 25 30 Glu Ala Pro Cys Leu Gly Ala Gly Ser Ala Gly Cys Gly Arg Ile Gly 35 40 45 Val Ala Gln Ser Ala Ala Leu Gly Thr Gln Asp Ile Gly Asp Pro Pro 50 55 60 Pro Pro Leu Pro Leu Pro Gln Gln Leu Val Pro Pro Pro Pro Thr Pro 65 70 75 80 Pro Cys Cys Ser Gln Arg Asp Asp Ser Gly Cys Ser Gly Pro Pro Ala 85 90 95 Ser Gly His His Gly Lys Gly Leu Thr Val Val 100 105 15 260 DNA Homo sapiens CDS (9)..(257) Second open reading frame 15 gagcggcg atg gga cca ggg tcc aga gca atg gag gct ccg tgc ctg gga 50 Met Gly Pro Gly Ser Arg Ala Met Glu Ala Pro Cys Leu Gly 1 5 10 gcg ggt tct gcc ggc tgc gga agg ata ggg gtg gca cag tca gcc gcc 98 Ala Gly Ser Ala Gly Cys Gly Arg Ile Gly Val Ala Gln Ser Ala Ala 15 20 25 30 ttg ggg aca cag gac ata ggg gac cca cca ccg cca ctg ccg ctc ccg 146 Leu Gly Thr Gln Asp Ile Gly Asp Pro Pro Pro Pro Leu Pro Leu Pro 35 40 45 cag cag ctc gtg ccg cca cca ccc acg cct cct tgc tgc agc cag cgt 194 Gln Gln Leu Val Pro Pro Pro Pro Thr Pro Pro Cys Cys Ser Gln Arg 50 55 60 gac gac agc ggc tgc tcc gga cca ccc gca agc ggc cat cac ggc aag 242 Asp Asp Ser Gly Cys Ser Gly Pro Pro Ala Ser Gly His His Gly Lys 65 70 75 gga cta acc gtc gtg taa 260 Gly Leu Thr Val Val 80 16 83 PRT Homo sapiens 16 Met Gly Pro Gly Ser Arg Ala Met Glu Ala Pro Cys Leu Gly Ala Gly 1 5 10 15 Ser Ala Gly Cys Gly Arg Ile Gly Val Ala Gln Ser Ala Ala Leu Gly 20 25 30 Thr Gln Asp Ile Gly Asp Pro Pro Pro Pro Leu Pro Leu Pro Gln Gln 35 40 45 Leu Val Pro Pro Pro Pro Thr Pro Pro Cys Cys Ser Gln Arg Asp Asp 50 55 60 Ser Gly Cys Ser Gly Pro Pro Ala Ser Gly His His Gly Lys Gly Leu 65 70 75 80 Thr Val Val 

What is claimed:
 1. An isolated nucleic acid molecule selected from the group consisting of: (a) an isolated nucleic acid molecule that encodes the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; (b) an isolated nucleic acid molecule which hybridizes under stringent conditions to the complement of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or 15; (c) an isolated nucleic acid molecule which hybridizes under stringent conditions to the complement of a nucleic acid molecule that encodes the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; (d) an isolated nucleic acid molecule that encodes a protein that exhibits at least about 90% amino acid sequence identity to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16; and (e) an isolated nucleic acid molecule comprising the complement of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, or
 15. 2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule comprises nucleotides 70-1041 of SEQ ID NO:
 1. 3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule consists of nucleotides 70-1038 of SEQ ID NO:
 1. 4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule encodes the amino acid sequence of SEQ ID NO:
 2. 5. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule encodes the mature protein of MC21 or amino acid residues 39-323 of SEQ ID NO:
 2. 6. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule encodes the extracellular domain of MC21 or amino acid residues 39-278 of SEQ ID NO:
 2. 7. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule encodes the extracellular domain of MC21 contiguous with the transmembrane domain of MC21 or amino acid residues 39-278 of SEQ ID NO:
 2. 8. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule is operably linked to one or more expression control elements.
 9. A vector comprising an isolated nucleic acid molecule of claim
 1. 10. A host cell transformed to contain the nucleic acid molecule of claim
 1. 11. A host cell comprising the vector of claim
 9. 12. The host cell of claim 11, wherein said host is selected from the group consisting of prokaryotic host cells and eukaryotic host cells.
 13. A method for producing a polypeptide comprising culturing a host cell transformed with the nucleic acid molecule of claim 1 under conditions in which the protein encoded by said nucleic acid molecule is expressed.
 14. The method of claim 13, wherein said host cell is selected from the group consisting of prokaryotic host cells and eukaryotic host cells.
 15. An isolated polypeptide produced by the method of claim
 13. 16. An isolated antibody that binds to a polypeptide of claim
 15. 17. An isolated polypeptide or protein selected from the group consisting of an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, an isolated polypeptide comprising a fragment of at least 6 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, an isolated polypeptide comprising conservative amino acid substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16 or naturally occurring amino acid sequence variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or 16, and an isolated polypeptide exhibiting at least about 90% amino acid sequence identity with SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, or
 16. 18. An isolated polypeptide or protein of claim 16, selected from the group consisting of an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 2, an isolated polypeptide comprising mature MC21 protein or amino acid residues 39-323 of SEQ ID NO: 2, an isolated polypeptide comprising MC21 extracellular domain amino acid residues 39-278 of SEQ ID NO: 2, an isolated polypeptide comprising the extracellular domain of MC21 contiguous to the transmembrane domain of MC21 protein, an isolated polypeptide comprising conservative amino acid substitutions of SEQ ID NO: 2 or naturally occurring amino acid sequence variants of SEQ ID NO: 2, and an isolated polypeptide exhibiting at least about 90% amino acid sequence identity with SEQ ID NO:
 2. 19. An isolated antibody that binds to a polypeptide of claim
 17. 20. An antibody of claim 19 wherein said antibody is a monoclonal or a polyclonal antibody.
 21. A method of identifying an agent which modulates the expression of a nucleic acid encoding a protein of claim 17, comprising: exposing cells which express the nucleic acid to the agent; and determining whether the agent modulates expression of said nucleic acid, thereby identifying an agent which modulates the expression of a nucleic acid encoding the protein.
 22. A method of identifying an agent which modulates at least one activity of a protein of claim 17, comprising: exposing cells which express the protein to the agent; determining whether the agent modulates at least one activity of said protein, thereby identifying an agent which modulates at least one activity of the protein.
 23. The method of claim 21, wherein the agent modulates one activity of the protein.
 24. A method of identifying binding partners for a protein of claim 17, comprising: exposing said protein to a potential binding partner; and determining if the potential binding partner binds to said protein, thereby identifying binding partners for the protein.
 25. A method of modulating the expression of a nucleic acid encoding a protein of claim 17, comprising: administering an effective amount of an agent which modulates the expression of a nucleic acid encoding the protein.
 26. A method of modulating at least one activity of a protein of claim 17, comprising: administering an effective amount of an agent which modulates at least one activity of the protein.
 27. A non-human transgenic animal modified to contain the nucleic acid molecule of claim
 1. 28. The transgenic animal of claim 27, wherein the nucleic acid molecule contains a mutation that prevents expression of the encoded protein.
 29. A method of diagnosing a disease state in a subject, comprising determining the level of expression of a nucleic acid molecule of claim
 1. 30. The method of claim 29, wherein the disease state is allergic hypersensitivity.
 31. The method of claim 29, wherein the disease state is seasonal rhinitis.
 32. The method of claim 29, wherein the disease state is asthma.
 33. The method of claim 29, wherein the disease state is urticaria or atopic dermatitis.
 34. The method of claim 29, wherein the disease state is mastocytosis.
 35. A composition comprising an isolated nucleic acid molecule of claim 1 and an aqueous carrier.
 36. A method of diagnosing a disease state in a subject, comprising determining the level of expression of a protein of claim
 17. 37. The method of claim 36, wherein the disease state is allergic hypersensitivity.
 38. The method of claim 36, wherein the disease state is seasonal rhinitis.
 39. The method of claim 36, wherein the disease state is asthma.
 40. The method of claim 36, wherein the disease state is urticaria or atopic dermatitis.
 41. The method of claim 37, wherein the disease state is mastocytosis. 